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PREFACE 


The plan for Statistics in Psychological Research was developed, in outline 
and detail, over a six-year period, during which the author was a member of 
the faculty of the Department of Psychology at Pennsylvania State University. 
In twelve semesters, nearly five hundred advanced undergraduate students 
and graduate students were enrolled in a statistics course which was taught by 
the author and which served as the testing ground for the methods and 
materials of the text. The students were primarily majors in psychology, but 
students from education, speech, home economics, civil engineering, physical 
education, mathematics, and mineralogy constituted a substantial proportion 
of the enrollment. In addition, numerous members of the faculty from 
various departments of the University audited the course. `~ 

The text is intended for a one-semester course in psychological statistics. 
It may be used as an introductory text for advanced undergraduates and 
graduate students. It may also be used as a second- or intermediate-level text 
for psychology majors in general. It covers the standard topics, but concen- 
trates on critical issues rather than some minor or peripheral matters. 
Emphasis is given to the distinction between descriptive and inferential 
statistics, between experimental and non-experimental research, and between 
testing significance and estimating parameters. 

Considerable space is devoted to developing the concept of psychological 
statistics as an important part of the language of the research psychologist. 
To assist the reader in developing this concept, chapters are identified as 
referring to syntactics, semantics, or pragmatics. Thus an attempt has been 
made throughout the text to organize the discussion of statistical issues 
around certain fundamental notions about the study of language. This 
language approach to the study of psychological statistics assists the student 
in recognizing the nature of a given issue and its resolution without introduc- 
ing any additional assumptions into the discussion or placing any restrictions 
on the standard treatment of the subject. 


vi 

Dr. Kendon R. Smith and Dr. Lewis R. Aiken, Jr., presently my colleagues 
at The Woman's College of the University of North Carolina, made many 
helpful criticisms and suggestions. 

I am indebted to the authors and to the publishers, Oliver & Boyd Ltd., 
Edinburgh, for permission to reprint Table No. III from Statistical Methods 
for Research Workers by Professor Sir Ronald A. Fisher, and Table No. III 
from Statistical Tables for Biological, Agricultural, and Medical Research by 
Professor Sir Ronald A. Fisher and Dr. Frank Yates. 

Finally, I must give credit to my students whose expressions of interest 
and enthusiasm over the past six years were a constant source of encourage- 
ment in bringing this project to completion. 


W. S. R. 
Greensboro, North Carolina 
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STATISTICS IN PSYCHOLOGICAL RESEARCH 


LANGUAGE, SCIENCE, 
AND STATISTICS 


We shall begin with a definition. 

A language is a system of interrelated signs employed by the members of 
a social group for purposes of communication. 

In the totality of human language, one can discern numerous overlapping 
Systems and subsystems which vary considerably in size and complexity, and 
which are combined in various ways to suit the purposes of their users. The 
familiar, extensive sign systems of everyday life, such as English, French, 
German, and Russian, are often called natural languages to distinguish them 
from certain specialized and restricted languages. One specialized and 
restricted system of signs which is of interest to us here is what might be 
called the /anguage of empirical science. The language of empirical science 
can be described, in general terms, as consisting of technical language 
of various sorts along with some parts of one or more natural 
languages. 

Progress in empirical science, as the modern world has known it, would 
have been impossible without language. To achieve new additions to one's 
knowledge of natural phenomena, observation, a basic component of em- 
pirical research, must be converted to data which can be accumulated, 
analyzed, and reported. Language is necessary in recording and storing data, 
in organizing and systematizing data, in describing and predicting natural 
processes and events represented by the observations, and in generalizing and 
communicating the final results. 

Although mathematics is a language in itself, it also constitutes an 
important part of the more complex language system of empirical science. 
Mathematical language is a system of signs whose interrelations are unam- 
biguously and completely determined. It is a language remarkably adapted 
to the formation of relations among signs and the transformation of these 
relations. As a vehicle for logic and deduction it is unequaled by any other 
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language system. In mathematics very complex and extended operations upon 
certain initial sets of signs will yield numerous other sets without error or 
controversy. It is, finally, the language in which the scientist sets forth his 
propositions with the greatest precision. 

Mathematical statistics, a branch of mathematics, is also a language. (The 
suffix “ -ics " of the word “ statistics," as the term is employed at this point, 
indicates a branch of systematized knowledge and the term is construed as 
singular in number.) Mathematical statistics is a system of interrelated signs 
which includes certain special symbols in addition to the common symbols 
of arithmetic, algebra, geometry, calculus, and probability theory. It shares 
in the deductive perfection of mathematics. 

Mathematical statistics has developed rapidly and extensively in the 
present century. It has become an important part of the language of the 
biological and behavioral sciences. The significance of this development is 
indicated by the fact that the formal design and analysis of research in these 
Sciences are now largely guided and determined by statistical theory. 
Psychology and education share this concern with and dependence on 
statistics. 


The Study of Language 


There are numerous ways of analyzing and describing language, but the 
approach which seems particularly appropriate and valuable to the present 
discussion is that of Rudolf Carnap and Charles W. Morris, whose scholarly 
contributions in this area are widely known. (Selected writings by these 
authors are listed at the end of Chapter 1 as suggested readings.) We wish 
to indicate that the discussion which follows is based to a considerable 
extent, but not entirely, upon the writings of Carnap and Morris. In our very 
brief and introductory treatment of language analysis, we shall make no 
pretense of being highly technical and complete, and we shall not try to 
preserve certain fine distinctions which can be made between the views of 
various authorities in this area. 

Three points of reference can be employed in the analysis of any language. 
They are the sign, what is referred to by the sign, and the response 
of an interpreter made directly to the sign and indirectly to the sign's 
referent. 

The sign is the unit of language. Signs may be letters, numbers, words, or 
marks such as “ ? ", ** / ", and “ +”. Signs may be combined, arranged 
in sequence to form expressions, and composed into sentences. 

In general, what is referred to is an object! a property of an object, a 
relation between properties, a function involving relations, or a complex 


*When we say “ object," we intend “ object or event.” 


A 
1 


The Study of Language 3 


situation involving objects, properties, relations, and functions.? What is 
referred to is said to be designated by the sign and, in consequence, is called 
the designatum of the sign. 

In responding to a sign, an interpreter takes account of the designatum. 
Any attempt at a complete explanation or understanding of the response to a 
sign would have to include, as possible factors, the personality of the inter- 
preter and the nature of his group. Making the response of an interpreter a 
point of reference for the analysis of language has the effect of assigning a 
place of central importance in that analysis to the interpreter's personal 
characteristics and the social context within which he operates. 

A comprehensive investigation of a language would deal with signs and 
their interrelations; the designation of objects, properties, relations, functions, 
and situations by signs, expressions, and sentences; and the interpreter of the 
signs with the totality of his personal and social behavior associated with the 
use of the signs. Studies of language can, of course, vary in their comprehen- 
siveness. The terms “ syntactics,” “ semantics," and “ pragmatics " are em- 
ployed to distinguish three levels of comprehensiveness in language analyses. 
Syntactics is the study of language in which only the signs and their inter- 
relations are abstracted and analyzed, Semantics refers to language analysis 
in which signs and their designata are abstracted and investigated without 
reference to the interpreter or user. Pragmatics is the language study in which 
extensive reference is made to the interpreter or user of the language. 

Taking the sign, the designatum, and the response of the interpreter as 
three points of reference, we can characterize the three kinds or levels of 
language study by three simple relations. In syntactics the abstracted relations 
of signs to signs are analyzed. In semantics the relation between the sign and 
its designatum is the main object of study. In pragmatics the relation of 
principal interest is that between sign and user. 

Syntactics has to do with what one would ordinarily think of as the 
fashioning of letters, spelling, constructing sentences, and reasoning logically. 
In mathematics, formulas and equations are sentences; proofs and derivations 
are examples of deductive reasoning. Calculation is a special form of 
deduction applied to numerical expressions. 

A well-developed language has three sets of syntactical rules: rules which 
provide the means of introducing new signs; rules which indicate how signs 
can be combined, placed in sequence, and composed into sentences; and rules 
which guide deduction, the obtaining of a sentence from one or more other 
sentences, When expressions or sentences conform to these three kinds of 


_*For example, the object might be a ten-year-old child, the behavioral event might be 
his reading of several pages of test material. The property of interest to a psychologist 
might be the child’s speed of reading. A relation between properties might be shown by 
comparing the reading speed of one child with the speed of another. A function might be 
established for predicting differences in reading speed from differences in spelling accuracy 
for a large number of children. 
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syntactical rules, they are considered to be correct. When they do not conform 
in some respect, they are considered to be incorrect in that respect. 

Semantics is concerned primarily with designation. Several levels of 
designation can be distinguished. The lowest level is the relation between 
an object and a sign which simply directs attention to or points to the 
object. 

At a higher level, properties of objects, relations between properties, and 
functions are designated by signs or combinations of signs. Sentences which 
designate relations, functions, or situations are called propositions. 

At the lowest level of designation the assignment of signs to designata is 
often arbitrary.? These semantical definitions pose no special problem and, 
at the same time, add nothing to empirical knowledge. Although certain 
designations may be arbitrary, expressions and propositions which incor- 
porate them may be true or false.* Qualities of truth and falsity can be deter- 
mined only by experience and observation. Success in acquiring empirical 
knowledge is largely a matter of formulating true propositions. (Later we 
shall prefer using '' confirmable," a relative term, in place of " true," an 
absolute term.) 

At the higher and more complex levels of designation, the meaningful 
assignment of signs, expressions, and sentences to designata such as pro- 
perties, relations, functions, and situations is seldom, if ever, arbitrary.» The 
designation presupposes some kind of correspondence or isomorphism 
between signs and their designata. Verification of the correspondence is again 
only the yield of experience and observation. In particular we wish to 
emphasize that the development of an elaborate syntax, which may be 
entirely correct within itself, has no necessary consequence for semantical 
verification. 

There are other special signs which connect or indicate relations between 
signs but do not themselves designate objects, properties, relations, or 
functions. Examples of these special signs are the English words: “ all,” 
" and," “any,” * are," “every,” “if,” “is,” and “ not." There are still 
other signs which indicate the reaction of the user to the situation being 
described. In the following sentence, “ unfortunately ” indicates the reaction 
of the author of the sentence. “ Unfortunately, everyone agrees that statistics 

?Given three distinguishable objects and the three labels—RIL, ZUP, and NEM, the 


labels could be assigned as names for the objects in any arbitrary fashion as long as these 
labels were then used consistently in later communications. 

‘The words, * farmers," ** milk," and “ cows,” may have been assigned to their designata 
arbitrarily, but in our everyday experience only one order of these words produces a true 
proposition. 

"Scoring a set of test papers may yield a set of numbers, each of which is assigned to a 
student. Students may then be compared with respect to these scores. Furthermore, the 
scores might be employed in a specific mathematical equation for predicting other measures 
of performance. No part of this procedure would be considered meaningful by a psychologist 
if the assignment of numbers was arbitrary. Nor would the prediction be considered of. any 
value if the choice of the equation was completely arbitrary. 
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is misused, but few can agree on what specific practices actually constitute 
misuse.” We shall not attempt here a discussion of the designata of these 
special signs which either connect other signs or indicate the user’s reaction. 
Our reason is that the discyssion would go beyond the reasonable scope of 
these brief, introductory considerations and would not be especially profitable 
for the student of psychological statistics. 

We have already stated that success in acquiring empirical knowledge is 
largely a matter of formulating true semantical relations but that verification 
itself depends on experience and not on the logical manipulation of signs. What 
is the nature of the extremely important and critical process of verification? 

The question does not have a simple answer. We can say, in general, that 
one must observe the semantical rules of the language in question. A 
semantical rule is one which determines the conditions under which a sign 
or an expression is applicable to a designatum. In other words, a semantical 
rule provides the means of establishing correspondence between signs and 
what they refer to. The problem of correspondence in the case of attention- 
directing signs is not difficult. The semantical rule is that the sign designates 
what is pointed to. Much more difficult is the problem of correspondence 
for signs, expressions, or sentences which designate properties, relations, 


functions, or situations. 

In the development of logical empiricism, three views on the problem of 
Verifying propositions have been prominent. One view is that scientific 
Statements are verified by means of immediate experiences. That is to say, 
Propositions can be reduced to so-called primitive sentences about which 
there can be no question. A second view is that a proposition is checked only 
by incorporating it in a system with other propositions. These criterion 
Propositions have been called protocol sentences and are defined as direct 
Statements of observations as typically employed by natural scientists. A 
third view is that the base of all empirical knowledge is the ultimate reduction 
of all propositions to attention-directing and pointing, a process called 
substantiation. A 

Carnap has tried to synthesize these three views. He stressed the importance 
of distinguishing between the terms * truth " and “ confirmation as applied 
to propositions. Carnap maintained that truth is an absolute concept and 
that confirmation is a relative one, degrees of which vary with the progress of 
any science. He also distinguished between directly and indirectly testable 
Propositions. A directly testable proposition is one which, on the basis of 
observation, can be regarded as confirmed or disconfirmed. An indirectly 
testable proposition is one which is tested by directly testing other proposi- 
tions which are related to it. The testing operations include checking a pro- 
Position against observation and checking a proposition against other 


Propositions which have already been confirmed. 


"See Joergensen (6). 
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Pragmatics, the study of the relation between signs and the user of signs, 
includes and depends on syntactics and semantics but goes beyond them. A 
language cannot be adequately described, nor can its functions be completely 
explained, on the basis of its syntax and its semantical relations. Language as 
behavior is intimately involved with the attitudes, values, and judgments of 
the users. Usages may develop because of their convenience for the user, 
their effectiveness in communication, or their demonstrated practical 
benefits. 

Common-sense considerations enter pragmatics in the supposedly sound 
and valid judgments underlying the thinking, reasoning, and argument which 
play very important roles in the practical affairs of men. It is certainly true 
that some of what passes for common sense is not sound and valid. It is 
possible, too, that part or all of what is actually sound and valid judgment 
could be shown to be so by syntactical analysis with respect to the deductive 
processes of the language. Although common sense conclusions might, in 
theory, be reducible to formal deduction, it is quite impractical to do so asa 
general thing. In everyday life there is insufficient time to reduce all of one's 
judgments to the syntactical level of evaluation. Furthermore, only a few 
highly specialized scholars could do the logical analyses in those instances in 
which they are possible. Most of us believe that many important problems 
can be solved and their solutions communicated satisfactorily in language by 
an application of sound judgment on the part of one or more individuals 
without recourse to a formal analysis; and we act accordingly. Consequently, 
it seems appropriate to study, in pragmatics, the important role of common 
sense as it is exercised by the user of the language. 

Convention and habit are also important factors in language behavior. At 
any given point in time in any social group, certain ways of speaking and 
writing are taken for granted and used uncritically. If a member of the group 
does question a specific construction or idiom, he is often at a loss to explain 
it or account for it except as a convention. Even so, it would not be justifiable 
to conclude that conventional and habitual usages have completely arbitrary 
origins. They may have originated in the attitudes, values, and judgments of 
members of the social group of an earlier period. 

Corresponding to the sentence of syntactics and the proposition of 
semantics is the expectation of pragmatics. An expectation is the response 
of an interpreter to a proposition initiated by himself or another 
person. Expectations are fulfilled or not fulfilled in the experience of the 
user. 
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Psychological statistics, as we wish to consider it here, is a language 
consisting of mathematics, psychological terms and other technical English 
words, and words employed in formal English for speaking and writing. It 
is a language employed by psychologists in research, the objective of which is 
the acquisition of knowledge about the behavior of organisms.’ The signs of 
psychological statistics include numbers, the letters of the Latin and Greek 
alphabets, mathematical symbols, and words. The designata of psychological 
statistics are behavioral events or products, their properties and relations, 
taken singly and in groups produced by sampling. The interpreter in psycho- 
logical statistics is the research psychologist as he responds to and makes use 
of the signs. 

Our problem is to give an account of a specialized language by means ofa 
natural language. The specialized language is psychological statistics. The 
natural language is English. We must employ English to introduce and 
describe psychological statistics. The fact that the two languages overlap 
somewhat has both advantages and disadvantages. Common features of the 
two will sometimes facilitate our efforts at communication. On the other 
hand, important discriminations in psychological statistics are occasionally 
made difficult by the relatively loose usage which characterizes the ordinary, 
natural language. 

We shall not make a great fuss about this distinction between the specialized 
language and the natural language. It may be sufficient to say simply that 
we hope to proceed from everyday, classroom and textbook English, by a 
kind of process of successive approximation, to the statistical language of the 
research psychologist. The transition from the one to the other will be 
gradual and the dividing line between the two will often be imperceptible. We 
take it for granted that the transition will not be without discontinuities and 
that much backtracking will be necessary. 

We propose dividing the study of psychological statistics into three parts: 
syntactics, semantics, and pragmatics. This proposal deserves amplification 
and explanation in a number of respects. 

Syntactics, semantics, and pragmatics as studies of different dimensions of 
the same language are the results of abstraction. In no instance does the 
process of abstraction yield three perfectly discriminable disciplines. There is 
some natural and unavoidable overlap and interdependence. Furthermore, 
one does not have to try very hard to make this overlap and interdependence 
the cause for confusion. For example, treating as semantical problems those 
designations for which the designata are themselves linguistic abstractions is 

7Behavior is defined here as meaning a movement, an action, or a vocalization on the 
akg a living organism. Behavior is public, not private; that is, it is open to objective 
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not a profitable line of activity for our purposes. It is true that signs them- 
selves might be taken as designata, or the syntactical relations among signs 
might be so considered, or states of the user, or pragmatical issues, or 
semantical issues, or properties of properties. But enough of this proliferation 
of abstract designata. Common sense tells us that, if we continued, we could 
soon reach a level of complete absurdity. 

How the study of psychological statistics can be divided into syntactics, 
semantics, and pragmatics is not immediately obvious upon first considera- 
tion of the subject, a circumstance which will very likely provoke in the student 
some skepticism as to the desirability of doing so. Our previous experience 
indicates that, at many points, the distinctions among the three separate 
studies can be made and maintained with ease and confidence on the part of 
the student. At a few points the distinctions may be difficult and may appear 
to be arbitrary. The advantage which derives from the many relatively clear 
distinctions far outweighs the disadvantage of the few unclear ones. 

What is the advantage of dividing psychological statistics into three 
separate studies? We can answer this question by considering the kinds of 
problems encountered in the general study of the subject and how these 
problems can be grouped under the three headings: syntactics, semantics, and 
pragmatics. We shall see that these three groups of problems possess clearly 
distinguishable features and require for their solutions quite different 
approaches. The important advantage of the analysis, then, is that it brings 
into sharp focus the different kinds of problems and the peculiar nature of 
the solution appropriate to each kind of problem. 


EXAMPLES OF SYNTACTICAL PROBLEMS 


Recall that syntactics is the study of the relations of signs to signs. Although 
in psychological statistics we have problems of English syntax as well as 
mathematical syntax, we shall assume that students possess an acceptable 
level of competence in the former and that it will, therefore, be profitable to 
focus upon problems in the latter. Listed below are examples of problems 
which belong in syntactics. In these examples no attempt has been made to 
stay within the prior experience of the student, a circumstance which should 
not alarm him for issues of this kind will be dealt with in later chapters as 
it becomes necessary and appropriate to do so. 

1. Perform the arithmetic operations indicated below. 


42 x 21 + 31 x 56 + 29 x 38 
2. Given the equations, 
Bı + Borie = Fis, 


Biriz + Êe = rog, 


solve for B, and f, in terms of r12, ry3, and rss. 
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3. Transform the equation, 
Y'= bX + (Y — b,2X), 


by translating the axes so that the origin is at the point, (X, Y) 
4. Find the value of M, which minimizes the value of the expression 


2 (X,— Mj. 


5. Given a discrete population distribution for the variable X, find the 
mean of the sampling distribution of means of random samples of two values 
when the sampling involves replacement. 

The five problems given above are entirely devoid of empirical content. 
One has only to know the rules for handling the symbols and the solutions 
can be readily obtained. 

No doubt some of the five syntactical problems given above will be 
difficult for individuals who have a limited background in mathematics. In 
the face of such difficulties, beginning students in psychological statistics 
often imagine that they would be helped if they were given examples of 
applications. The fact is that applications almost always introduce new issues 
and complexities without resolving any of the original ones. Students who 
have difficulty with any of these problems will only overcome that difficulty 
by applying themselves diligently and with concentration to the syntactical 
rules involved in the solutions. It is possible, of course, that they need a 
review of mathematics or further training in that subject. 


EXAMPLES OF SEMANTICAL PROBLEMS 


Consider next the area of semantics in psychological statistics. Recall that 
semantics is the study of sign-object relations in terms of the confirmability 
of the designations. Again we must acknowledge that the nontechnical verbal 
portions of statistical language give rise to semantical problems but, as in the 
case of syntactics, we shall assume that students use everyday English with 
reasonable regard for the semantical rules and that the important matter here 
is sign-designatum correspondence in two areas: measurement and statistical 
inference. 

While a number of different levels of sign-designatum relations can be 
conceptualized, we shall limit our consideration here to one of these levels. 
In our discussion, what is designated is not itself a sign, a combination of 
signs, or a property of signs. The designatum is, rather, some physical, 
biological, psychological, or social attribute of an individual or a group. 
That is to say, what is designated is some aspect of behavior and not language 
or any part of language, formally so considered. 

For the behavioral scientist, two important kinds of designations can be 
distinguished. First there are the designations implied by measurement, the 
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process of quantifying observation. Psychological measurement is the designa- 
tion of the properties of behavioral events by means of numbers. Second there 
is the designation of the error variability in measurements. For example, 
specification of the sampling variability in measures of behavioral events is 
the designation of an empirical sampling distribution by means of a mathe- 
matical function. 

These two kinds of semantical relations, measurement and the specification 
of error variability, should be preeminent in the study of psychological 
statistics because the confirmability of these designations is of critical 
importance for the ultimate use of statistics in research. The behavioral 
scientist's chances of success in acquiring knowledge through his research are 
increased considerably if the research is carried out under conditions of 
confirmed measurement and confirmable specification of error variation. One 
could, of course, extend the notion of confirmability to the final results of 
research. Since an account of the use of statistics and the interpretation of 
results in research involves to a considerable extent comment on the users, 
we shall not treat the problems of confirming research results as semantical 
problems. Instead, we shall limit our concern with confirmability in semantics 
to the two intermediate issues: measurement and the specification of error 
variation. 

Listed below are five problems in psychological measurement. 

1. A student participates in the javelin throw in the track and field events 
at his school. His best throw under the standard conditions for the event is 
a distance of 35.45 meters. Opposite his name in the record of the meet there 
is written, as a measure of his performance on that occasion, '* 35.45 m." 
Can this method of assigning numbers to behavioral events be justified ? 

2. A candidate for pilot training is given a test of visual acuity. Two 
parallel black lines drawn to standard dimensions and located on a white 
background are presented as visual stimuli by means of a device which makes 
it possible to vary and measure, in millimeters, the separation between the 
lines. The candidate is instructed to report, on each presentation of the 
stimulus, whether or not the lines appear to be separated. The examiner 
determines the size of the separation which is detected correctly in 50 per cent 
of presentations. The result is that the candidate is given a measure of visual 
acuity of 11 millimeters. Does this procedure qualify as measurement ? 

3. An arithmetic test is administered to the pupils in the fifth grade of an 
elementary school. The test consists of 25 problems. The answers given by 
each pupil are classified as right or wrong and his correct answers are then 
counted. The number of correct answers is the score given to the pupil. Do 
the properties of the numbers correspond to the properties of the test 
performances? 


4. Ten applicants for a supervisory position are interviewed by three 
personnel psychologists. Each psychologist then rates every applicant on four 
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scales. The four scales are intended to measure important dimensions of the 
applicants’ behavior. A scale is represented by a horizontal line on which are 
located five, equally spaced points, numbered “ one " to “ five " from left to 
right. Each point is also labeled with one or two words intended to assist the 
rater in making his judgment. The twelve ratings from four scales and three 
judges are summed for each applicant. The sum constitutes the applicant's 
combined rating. Does this sum qualify as a measure of the characteristics 
of an applicant, as intended by the interviewers ? 

5. A clinical psychologist obtains 100 items, each item consisting of a 
statement which may be used by a mother in describing her behavior with 
respect to an infant son. He instructs a client to order these items in terms of 
the way they describe her relations with her son, the order running from those 
“ most like " herself to those “ least like " herself. He then asks her to order 
the items a second time as an ideal mother would have performed the task. 
The two orderings are then correlated. The product-moment coefficient of 
correlation for this particular client is 0.55. The psychologist reports the 
coefficient as a measure of agreement between the client’s self-concept and 
her ideal self-concept. Is this measure of agreement confirmable? 

The five semantical problems posed above all have to do with the observa- 
tion of behavior and ways of quantifying observation. In each example 
numbers are assigned to behavioral events. The purpose of the assignment is 
to characterize and record properties of the events. One certainly cannot 
assume that the assignment of numbers by any arbitrary system whatever 
will automatically achieve the goal of characterizing the properties of the 
events. Furthermore, there is nothing in mathematics or any other language 
which makes it possible to deduce evidence of confirmation or disconfirma- 
tion. Confirmation can only be achieved by experience. 

A comprehensive treatment of psychological measurement would go far 
outside the scope of a single text and would overlap the content of numerous 
courses in the conventional course offerings of a department of psychology. 
What we shall attempt in this book is a brief outline of the fundamentals of 
measurement. This discussion of the objective conditions under which the 
degree of correspondence between properties of numbers and properties of 
events designated by the numbers can be determined will be given in Chapter 
5. For the immediate present, we shall say only that these conditions have to 
do with the reproducibility and predictiveness of the assigned numbers. 

We have not yet finished with our examples of semantical problems. There 
is the other important class involving the specification of error variability, of 
which three examples are given below. 

1. A student has 10 coins. The two sides of each coin can be readily 
identified as “ head " and " tail." He tosses the 10 coins, then counts and 
records the number of heads. He repeats the tossing and counting procedure 
until he has 1,200 counts. He notes and records the number of times he 
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counted zero heads, one head, two heads, and so on. He then converts these 
frequencies to proportions of the total number of counts. 

He is then told by his teacher that the terms in the expansion of the 
binomial, 

(p +9)", 
when p — q — 0.50, will approximate the proportions he obtained. 

What justification is there for this prediction ? 

2. A machinist's apprentice is tested for his accuracy and precision in 
measuring the diameter of a standard metal rod using a micrometer. He is 
required to make a lengthy series of measurements in which the standard is 
alternated with other rods whose diameters vary somewhat but not enough 
to be detected by a visual inspection. His results for the standard rod are 
summarized in a frequency distribution. It is observed that this distribution 
is roughly symmetrical around a central value with relatively few values at 
either extreme and a relatively large concentration of values in and near the 
middle. A statistician sees the data and tries to fit a normal curve to them. 
The fit is considered quite good. He makes note of the fact that the apprentice's 
variation in performance is " normally distributed." Is this correspondence 
an accident or is it evidence of the operation of some general law of 
behavior? 

3. A teacher of statistics makes up a frequency distribution of 100,000 
numbers. The frequency distribution has the form of a normal curve except 
that it possesses a finite number of class or score intervals. The teacher then 
assigns students in his class a sampling exercise. He instructs them in the use 
of a device called a table of random numbers to draw a sample from the 
artificial frequency distribution. The students, working together, obtain 1,000 
samples, each one consisting of 10 values. The mean of each sample is com- 
puted and a new frequency distribution of the 1,000 means is constructed. 
The instructor predicts in advance that the form of the distribution of means 
will be approximately that of a normal curve. How does he know what the 
outcome will be? 

The semantical problem of specifying the form of sampling distributions 
will be discussed at some length in Chapter 13. In this preliminary considera- 
tion, we shall say only that confirmation of the correspondence between à 
mathematical function and an actual distribution cannot be deduced. We 
must look to experience for the verdict. 


EXAMPLES OF PRAGMATICAL PROBLEMS 


Let us turn now to pragmatics. The term “ pragmatics " probably suggests 
to most students the study of practical applications. Although this inter- 
pretation of the study of sign-user relations is not irrelevant, it has the fault 
of being an oversimplification and might hinder the development of more 
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meaningful notions if we did not at once caution the student regarding this 
possibility. 

The danger is that a student, when he comes to pragmatics, may think that 
all he has to do is memorize a list of statistical methods and corresponding 
applications. At a very low level of sophistication and understanding his task 
might be so construed. But there is much more than imitation in employing 
statistics in research and consequently much more than memorizing in 
pragmatics. 

Let us illustrate with three issues. 

1. Summarization of data is one application of psychological statistics. 
One might look upon this summarizing process as a condensing of the 
original data to a form which makes them more convenient to handle but 
which lacks the original detail. There are, of course, degrees of condensation. 
One could reduce a mass of data—let us say 10,000 observations—to a single 
value, or to two values, or three, or one hundred, or whatever number one 
chose. And what does one choose? What degree of condensation is desirable? 
How much detail can one afford to lose? The answers to these questions will 
depend on a complex of considerations arising out of the practical situation 
and relating to current practices among research psychologists. 

2. A common problem for a research psychologist is the choice of an 
equation to express a relation between two variables. Equations vary from 
simple to complex. Ideally an investigator would want to use the equation 
which was correct or best, in some sense, regardless of complexity. Practically 
he must take account of the inconvenience and expense involved and he will, 
therefore, often accept a simple equation as an approximation and reject an 
intractable, but possibly more exact, one. Unfortunately there is no simple 
rule for comparing a loss in predictiveness with gains in convenience and 
economy. Consequently, there is a fair amount of disagreement among 
psychologists in the choice of approximations of various kinds. Although one 
may justifiably deplore this state of affairs, there is no point in insisting on 
perfection in these matters. It is in the very nature of research that one must 
be willing to employ approximations even when they are subject to somewhat 
arbitrary selection. 

3. In examining the results of an experiment, we recognize that error may 
have produced the result. We agree to a standard evaluation procedure which 
will ensure our being misled by error on no more than one in twenty occasions, 
in the long run. Why do we resign ourselves to being mistaken in our evalua- 
tions? Why do we choose one-in-twenty as the ratio and not some other? 
What uncertainty is revealed in this final projection into the future—* in the 
Jong run "? The answers to these questions involve a mixture of common 
sense, convention, and experience. They will be amplified in later discussions. 
rlier to the correlatives: the sentence of syntactics, 


A reference was made ea 
s, and the expectation of pragmatics. We say that 


the proposition of semantic 
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a sentence is correct or incorrect, a proposition is confirmed or disconfirmed, 
and an expectation is fulfilled or not fulfilled. What constitutes the fulfilment 
of an expectation? The most direct answer is that, if the researcher employs 
correct syntax and confirmed semantical relations, his immediate expectations 
are fulfilled in the acquisition of empirical knowledge. 

Although the acquiring of knowledge is a prime goal of the researcher, 
there is more to be said about the user of statistics and his expectations, in a 
broader sense. Students are confronted with what they often perceive as a 
burdensome chore in studying psychological statistics. What are the con- 
comitants and products of this learning experience, in addition to the very 
fundamental development of research skills? 

The first important outcome of studying and mastering psychological 
statistics is that one comes to understand research issues and to know that 
one understands them. Understanding and knowing that one understands 
these issues are necessary preconditions for intellectual independence and 
authority, for successful self-direction, and for a personally healthy and 
socially acceptable degree of confidence in one's work. 

The second important outcome is that one comes to be understood by one's 
colleagues and to command agreement and assent, on an intellectual level, in 
the resolving of research issues. 

The third important outcome is that one becomes a better teacher of 
psychology and psychological research methods in the classroom, the 
laboratory, and the field. 


Looking Ahead 


In anticipation of what lies ahead, let us consider some of the prominent 
features of the plan for dividing psychological statistics into syntactics, 
semantics, and pragmatics. 

The plan does not involve a simple division of the subject matter into three 
parts, to be presented in three successive sections of the book. There is some 
appropriate and necessary alternation of the three approaches. Furthermore, 
they do not receive equal emphasis and elaboration. 

Since the syntax of arithmetic and algebra is very important in psycho- 
logical statistics, exercises and suggestions for review of these elementary 
mathematical subjects are given in Chapter 2. 

Another syntactical topic, the symbolism of sums, is given a fairly detailed 
treatment in Chapter 3. 

Before we can continue with the outline, we must introduce a few terms. 
It was pointed out earlier that the word “ statistics," construed as singular, 
indicates a branch of knowledge. The meaning of the word “ statistic," con- 


Looking Ahead 15 


strued as singular, deserves comment. We shall not use the word “ statistic ” 
without a qualifying prefix. Any value computed from a set of measures will 
be called a “ d-statistic " because it is, to some degree, descriptive of the set. 
By definition, a d-statistic does not involve the notion of error and, in a 
formal sense, nothing can be said about its error. The plural form of 
d-statistic is obtained by adding “ s." 

Any value computed from a set of measures subject to random variation, 
as in computing from the measures in a sample drawn from a population, 
will be called an * i-statistic ” because certain inferences may possibly be 
based on that value. An i-statistic has descriptive value with respect to its 
sample; a d-statistic does not have inferential value for a population. The 
plural of i-statistic is obtained by adding " s.” 

While the qualifications represented by the prefixes, “ d- " and “ i," are 
widely known and understood, these prefixes are not in general use. The 
distinction between two corresponding branches of systematized knowledge, 
descriptive statistics and inferential statistics, is, however, frequently made by 
statisticians. 

The thirteen chapters from 4 through 16 are classifiable, as follows: 


Chapter Topic 
Syntactics 4 A class of values called moments 
7 A class of values called product-moments 
8 Partial and multiple correlation 
10 The analysis of variability in classifications of 
numbers 
11 Sampling theory 
12 Sampling distributions 
Semantics 5 Measurement 
13 Practical random sampling 
Pragmatics 6 Descriptive uses of moments 
9 Descriptive uses of product-moments 
14 Statistical inference in research 
15 Tests of significance in experimental research 
16 Tests of significance and estimates of parameters 


in non-experimental research 


Chapter 17 deals with syntactical and pragmatical issues relating to the 
application of statistical inference to frequency classifications. 

Note that two chapters, 6 and 9, are devoted to descriptive statistics, while 
four chapters, 14 through 17, deal with inferential statistics. 

Because we believe that learning is facilitated if the student keeps in mind 
the division of the subject into syntactics, semantics, and pragmatics, each 
chapter or section devoted solely to one of the three kinds of study is so 
designated by an appropriate sub-heading. 
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Syntactics | 2 


Students are sometimes handicapped in statistics by their failure to recall, 
or to grasp the significance of, certain very elementary principles or rules in 
arithmetic and algebra. It is our purpose here to provide a brief review of 
arithmetic and algebra with focus upon those principles and rules which have 
the most direct bearing on the work to be undertaken later. It is not our 
intention to present a course in arithmetic and algebra. Students who have 
had no previous training in algebra should either set out upon a course of 
self-directed study using a standard text or enroll in a first course in college 
algebra.! 

Exercises will be provided at appropriate points in the review. The student 
should make full use of these exercises for the evaluation of his progress and 
the diagnosis of his difficulties. Answers will be given to the exercises bearing 
odd numbers but not to those bearing even numbers. 


On Arithmetic Operations 


Addition is indicated by the sign, “ + "". 
84-9 


Subtraction is indicated by the sign, * — 
7—5 


1One algebra text which students might find helpful is J. B. Rosenbach, E. A. Whitman, 
B. E. Meserve, and P. M. Whitman, College Algebra (Boston: Ginn, 1958). A book in 
which many psychology and education students have done their reviewing of the funda- 
mentals of mathematics is Helen M. Walker, Mathematics Essential for Elementary 
Statistics, rev. ed. (New York: Holt, 1951). 
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Multiplication can be indicated in three different ways. 
5 x 3or 5(3) or 5:3 


Division can be indicated by either a slant, “ [^",ora bar, * —**. 


9 
9/2 or 2 


The order in which numbers are added does not affect the sum. For 
example, the summation, 


3-484 6 


is usually performed in the following steps which, under ordinary circum- 
stances, would not be written out: 


3 + 4 is equal to 7, 


7 + 8 is equal to 15, 
15 + 6 is equal to 21. 


Note that the numbers were accumulated from left to right. Arranging the 
same numbers in a different order, we have 


8-426223. 
Accumulating, as before, from left to right, we take the following steps: 


8 + 4 is equal to 12, 
12 + 6 is equal to 18, 
18 4- 3 is equal to 21 


which is the same sum obtained from the original order. Any other order of 
accumulating the same numbers would give the same answer. 

Grouping of numbers in addition does not affect the sum. If we use 
parentheses to indicate arbitrary groups whose sums are to be obtained before 
the final summation takes place, we can write: 

8 +11 +16+5 
as 
(8 +11) + (16 + 5) 
which is equal to 
19 + 21 
which is equal to 
40. 


We can write the same summation as: 


(8 + 16) + (11 + 5) 
which is equal to 
24 + 16 
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which again is equal to 
40. 


The order in which numbers are multiplied does not affect the product. 


Given the problem, 1 
5x8x3x4, 


we can multiply from left to right. 
5 x 8 is equal to 40, 
40 x 3 is equal to 120, 
120 x 4 is equal to 480. 


These same numbers can be written and multiplied in a different order, as 


below, 
3x8x4x5, 


but the final product is the same. 
3 x 8 is equal to 24, 
24 x 4 is equal to 96, 
96 x 5is equal to 480. 


Grouping of numbers in multiplication does not affect the final product. 
Again we can use parentheses to indicate the operations which are to be 


performed first. 
3x8x10x6 


is equal to 
(3 x 8) x (10 x 6) 
is equal to 
24 x 60 
is equal to 
1,440 


which is the same result we would get if the numbers were grouped as 


(3 x 10 x (8 x 6 


which is equal to 
30 x 48 


is equal to 
1,440. 


When addition and multiplication occur in the same expression, it is 
necessary to have rules regarding which is performed first. In the following 
the multiplications are performed before the addition: 


3x54+8x64+7x7 


sum of products, 
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is equal to 
15 4- 48 4- 49 
is equal to 
112. 


With respect to the expression, 
5(9 + 7), 


which contains addition and multiplication, there are two correct solutions. 
In the first solution the addition within the parentheses is performed before 
the multiplication. That is, 


5(9 + 7) 
is equal to 
5(16) 
is equal to 
80. 


In the second solution each number within the parentheses is multiplied by 
five and the products are then added. This procedure is an example of 
distribution of multiplication. Thus 


5(9 + 7) 
is equal to 
5x9+5x7 
is equal to 
45 + 35 
is equal to 
80. 


There are also two correct solutions for the problem, 


(2 + 8)(6 + 5) 


In the first solution the additions within parentheses are performed before 
multiplication. Thus 

(2 + 8)(6 + 5) 
is equal to 


(10)(11) 


110. 


In the second solution multiplication is distributed and then addition is 
performed. That is, 


is equal to 


(2 + 8)(6 + 5) 
is equal to 
2x64+2x5+8x648x5 


is equal to 
12 + 10 + 48 + 40 
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is equal to 
110, 


which is the answer obtained in the first solution. 

The order and grouping rules for addition apply also to subtraction and 
to combinations of both operations. 

The order, grouping, and distribution rules for multiplication apply to 
division when the division is expressed as equivalent multiplication by 
employing the reciprocal of the divisor. (The reciprocal of a number is '* one ” 
divided by the number.) 

Examples of division expressed as equivalent multiplication: 


3, 1 
T is equal to 4 (3), 


G + 5 is equal to : (4 4- 5), 


Gr 

(5 + 2) 

Caution: The use of the sign, “+”, to indicate division can yield 

ambiguous expressions where more than one division is to be performed or 

where both multiplication and division occur. We shall not use the symbol. 

Note that the position of the bar or the slant is critical in expressing 
division. Observe that 


, 1 
is equal to G+J (3 + 6). 


3/4 1 a. 6. 3. 
ae is equal to => X is equal to 5 x 45 equal to 20 


5/6 5/6 4 
whereas 
3/4)/5 1 3/4. TT ; 
s is equal tog X wf is equal tog X 5 X gis equal to. 


Signed Numbers 


A signed number is a number with a sign, either “ + " or “ — ”, preceding 
it. A number preceded by “ + ” is a positive number; a number preceded by 
" — ” is a negative number. Signed numbers are also called directed numbers 
or algebraic numbers. 

In working with signed numbers, we sometimes wish to disregard the signs 
and consider only the so-called absolute values of the signed numbers. To do 
$0 we drop the signs and perform operations with the numbers as if they 
were unsigned. Thus the absolute value of 4-9 is 9; the absolute value of —4 


€ 


ISo:72? ) 3 
RAY 
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is 4. The absolute value is also called the arithmetic value or the numerical 
value. The symbol for absolute value is two vertical, parallel bars written as 
|—6|, which is equal to 6, and 
|+8 | , which is equal to 8. 
An expression involving numbers and signs, such as 
+3—1+2—4—2 +06, 


can be interpreted correctly in any one of three ways. The signs can be taken 
as referring to the operations of addition and subtraction involving unsigned 
numbers, or to the operations of addition and subtraction involving all 
positive numbers, as indicated below, 


(+3) — (+1) + (+2) — (+4) — (+2) + (+6), 
or to the addition of positive and negative numbers, that is, as 
(+3) + (—1) + (42) + (74) + (722) + (+6). 
Rules have been established for the addition and subtraction of signed 
numbers. They are listed below. 


1. To add numbers with like signs, add the absolute values and prefix the 
sum with the common sign. 


2. To add numbers with unlike signs, add the absolute values of those 
with positive signs, add the absolute values of those with negative signs, 
subtract the smaller of these two sums from the larger, and finally prefix to 
the difference the common sign of the numbers in the larger sum. 


3. To subtract one signed number from another, change the sign of the 
number to be subtracted and proceed as in addition. 


Rules have also been established for the multiplication and division of 
signed numbers. They are given below. 


1. The product or quotient of two numbers with like signs is positive. 
2. The product or quotient of two numbers with unlike signs is negative. 


In writing expressions involving signed numbers, the signs of positive 
numbers are often omitted but understood to be in effect. 


In multiplication, when the two signs are positive, the product may be 
interpreted as the repeated addition of a positive number. That is, 


6(3) is equal to 3 +3 +3 +3 +3 +23. 


When one sign is positive and the other is negative, the product may be 
interpreted as repeated addition of a negative number. Thus: 


7(—4) is equal to (—4) + (—4) + (—4) + (—4) + (—4) + (—4) + (—4). 
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When both signs are negative, the product is defined as positive, thereby 
maintaining consistency in the distribution of multiplication. Consider the 
problem, 

. —3(7 — 2). 


There should be two correct solutions. In one we subtract within the paren- 
theses before multiplying. Subtraction gives us 
—3(5) which is equal to —15. 
In the other solution, we distribute the multiplication by —3 over both 
numbers within parentheses, obtaining 
(—3)(7) + (—3)(—2). 


The first product has factors with unlike signs and is, therefore, negative. The 
second product has two factors with negative signs. This second product 
must be given a positive sign to yield a final answer of —15 and be consistent 
with the first solution. That is, 

(—21) + (+6) is equal to —15. 


The sign rules for division are justified by expressing the division as 
equivalent multiplication and applying the sign rules for multiplication. 


Geometric Interpretation of Signed Numbers 


A signed number is interpreted as a distance on a scale measured in one 
of two directions. Consider the scale represented below by the horizontal line 


marked off in equal segments. 


bh, —4——4— }— $$ + + _+_+—_+—_ + + 1 
~8 -7 6 —3 4 —à — —1 0 41 42 43 «4 45 46 +7 +8 


Notice there is a point labeled “ 0 ” and other points numbered consecutively 
in each direction, that is, to the left and to the right of the zero point. The 
Point labeled with zero is called the origin. Points to the right of the origin 
are labeled with positive numbers which increase in value as one reads the 
Scale from left to right; points to the left of the origin are labeled with 
Negative numbers which decrease in value as one reads from right to left. A 
Signed number expresses a distance in either the positive or the negative 
direction and locates a point on the scale. The correspondence between a 
Signed number, Y, and a point on the scale, P, is acknowledged by calling 
the number the coordinate of the point and the point, the graph of the number. 
„The length of a segment of the scale between the origin and point P, 
disregarding the direction, corresponds to the absolute value of X. 
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Geometric Interpretation of the Combining 
of Signed Numbers 


An expression containing positive and negative numbers can be referred 
to the scale given above and interpreted as locating a point on the scale. For 
example, the expression, 


35-1424 14 


is interpreted, as follows. 

Start at zero on the scale and measure three units in a positive direction, 
then one unit in the negative direction, then two positive, four negative, one 
positive, and finally two positive. Movements along the scale are diagrammed 
below. They stop at the point, +3, which corresponds to the value of the 
expression after all the operations have been carried out. 


+ + + + + + + + + + + + + + -} 1 
-8 —7 —6 —5 —4 —3 2 —] 0 +1 +2 +3 «4 —5 +6 +7 48 
+3 
— 
=) 
Es 
ae 
i —4 
+1 
—> 
+2 
———- 


Fractions 


A fundamental principle of importance in working with fractions is that 
multiplying (or dividing) the numerator and the denominator of a fraction 
by the same quantity does not change the value of the fraction. 


Example: 

3 icequal t ur lt 3m 

7 is equal to 5(7) is equal to 8(7) 
Caution: Adding or subtracting the same quantity in the numerator and 

the denominator does change the value of a fraction. 


In adding and subtracting fractions we obtain a common denominator and 
then add the numerators. 


Example: 


ET lt 3 3 i 1 H 
3 5 İS equa 91s + 75 ÍS equa to 15 
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In multiplying fractions we obtain the product of the numerators and the 
product of the denominators, and write the former over the latter. 


Example: 
x equal to 
5 7 Is eq 


In dividing fractions we invert the divisor and multiply. 


35 


Example: 
1/3 


x 
3/5 


l " 
is equal to is equal to 


3 9 


3 


Algebraic Symbolism 


Students of psychological statistics soon discover, and are often exas- 
perated by, the fact that practices in writing symbolism vary considerably 
from one author or teacher to another. The brief description of writing 
practices given below should not be taken as a universal, invariant standard. 
It is well for students to develop, at the very beginning, an attitude of adapta- 
bility and flexibility where symbolism is concerned. One of the chopera 
statistician takes for granted as part of his lot in life is the constant adjust- 
ment and readjustment of his frame of reference for the interpretation of 
symbols as he goes from one text or article to another. 

With these cautions a matter of record, we shall proceed to describe 
certain rather general but not necessarily fixed practices in writing symbolism. 

Letters are used, in a variety of ways, to represent numbers. 


Example: Let X represent any number in a set or class of numbers. 


We use different letters for different sets or classes. 

Example: Let X represent any number in one set; let Y represent any 
number in another set. 

Numerical subscripts are attached to letters to permit identification of 
particular numbers within a set. 

Examples: X, refers to the third number in a set; Y; refers to the seventh 


number in another set. The assignment of numerical subscripts may be 
arbitrary, 


Letter subscripts also are employed for purposes of identification. 


Examples: X; refers to the it number in a set, where i is any integer from 
MS I S, refers to a number computed, by means of a certain formula, 
from the numbers in the X set. The symbol, r+, refers to a number computed, 
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by means of a certain formula, from the numbers in the Y set and the Y set. 
Identification is also achieved by means of superscripts. 


Example: We might let Y' (read * Y-prime ") represent any number in one 
set and Y represent any number in another related set, as when each value 
of Y' is computed from a corresponding value of Y by means of a stated rule. 

We sometimes need seconds, such as XY" and Y", as well as primes. 


Capital letters and small letters serve different purposes. 


Example: We might let X represent any number in one set and x represent 
any number in another set obtained by applying a transformation to each X. 


Small letters from the first part of the alphabet often represent constants 
while letters, capital or small, near the end of the alphabet often represent 
numbers in sets as indicated earlier. 


Examples: We might let a represent a particular number whose value 
remains unspecified and Y represent any number in a set of numbers whose 
values may vary but also remain unspecified. 


On occasion a letter is selected by an author or a teacher because it is the 
first letter of some descriptive word or term in a problem but this practice 
is not always possible or desirable. 

Thus far we have referred only to letters of the familiar Latin alphabet. 
The demands of our statistical language force us to go to the Greek alphabet 
for additional symbols. The Greek alphabet is given below. Capitals are on 
the left; small letters are on the right. 


Letters Names Letters Names Letters Names 
A, « Alpha Lu Iota P, p Rho 
B, B Beta K, « Kappa X o Sigma 
Dy Gamma A, A Lambda T, + Tau 

4, 8 Delta M, u Mu Yo Upsilon 
E, € Epsilon N, v Nu $, 4 Phi 
Zt Zeta 5, £ Xi X. x Chi 
Hin Eta O, o Omicron — V, jj Psi 

0, 0 Theta II, z Pi Q, w Omega 


The Greek capital letter, Z, is a very important statistical symbol but it is 
not used to represent a number. As we shall use it here, it will always be 
written before at least one other numerical or literal symbol to indicate an 
Operation of summation to be performed with reference to that other symbol. 
The symbol, X, when it indicates a summation, is called an operator. Com- 


mon practices and rules in summational notation are discussed at length in 
Chapter 3. 
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As revealed by the examples given below, the operations of arithmetic can 
be indicated with letters representing numbers. 


Addition: . X+Y4Z 
p t4 
n +l 
X +a 

Subtraction: x—y 
N-1 
Y—c 
a -b—c—d 
Multiplication: XY (read “ X times Y”) 

aX 
3XY 
a(X 4- Y) 

Division: X/Y (read “ X divided by Y”) 
a[ X 
(X + Y)/N 
(X — a)/5 


It is important to keep in mind that an indicated sum, difference, product, 
or quotient, no matter how complex it may be, represents a number which 
can be employed in further operations. 


Equality and Inequality 


The symbol of equality, —, is read ' is equal to." It is used in stating that 
two expressions represent the same number. 
One kind of equality statement is the identity. An example is given below. 


3x — 2) = 3x —6 
This statement qualifies as an identity because the equality holds for all 


values of x. 
A second kind of equality statement is the equation, an example of which is 


3X 4-5 — 29. 


This equation is distinguished from an identity because there are values for 
X for which the statement of equality does not hold. 

A third kind of equality statement is the formula, which states that the 
number represented by one symbol is the same as the number represented 
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by an expression involving one or more other symbols. The following state- 
ment is an example of a formula: 


Z, = (X — MJIS.. 


The symbols of inequality are * < ", which is read “ is less than,” and 
> ”, which is read “is greater than." An inequality symbol appears in a 
statement that the number represented by one expression is less than (or 
greater than, as the case may be) the number represented by another 
expression. 


e 


Examples (with interpretations): 


m<n m is less than n 
P < 0.05 P is less than five hundredths 
amb a is greater than 5 
FE. F is greater than Fs; 
0.01 — P < 0.05 0.01 is less than P which is less than 0.05 


The symbol for nonequality, +, is read “ is not equal to.” It is employed 
in stating that the number represented by one expression is not the number 
epsesente by another expression. 


Example: Yn general, 


XY, + XY x (X, + XXY, 4X 
The following exercises are provided to giv i i 
writing algebraic symbolism. give opportunity for practice in 


Exercise 2.1 


Let a represent a particular num 


b ; 
dioi er. Write the expression for 
2. the number minus 3 
3. the product of the nu 
" : 
; the number divided by D iles 
; the reciprocal of the numb 
2. Wided by the number - 
e 
Ei VK. Multiplied by its reciprocal 
SNY, @ ivi 
^ the Quantity siu a ea y rine 
* the quanti a divided ; Multiplied by the quantity a plus 8 
Y 2, multipli 
"T tins plied by 9 
reise 2 | 
1. 35 
RT 
D te 
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Exercise 2.2 


Let c represent a particular number. Write the expression for 
1. the product of the number and 5 
. the number plus 10 
. the number multiplied by itself 
. the ratio of the number to 100 
- one divided by the number 
. the number divided by 10 
the reciprocal of the number multiplied by twice the number 
the quantity, c minus 3, divided by the reciprocal of c 
the quantity, c plus 3, multiplied by the quantity, c minus 3 
20 times the reciprocal of the number 


O v 0 x0 t BUM 


Exercise 2.3 


Let a represent a particular number and let X;, Xs, Xa, and X; be four numbers in 
a set. Also let X; represent any number in the set where į may be any integer from 
ud 4? 
Write: 
1. the sum of the numbers in the set 
2. a times any number in the set 
3. the product of a and the first number 
4. the product of a and the second number 
5. the third number multiplied by a 
6. a times the fourth number 
7. the sum of the products written in (3), (4), (5), and (6) 
| 8. a times the sum of the four numbers 
9. the sum of the four numbers divided by 4 
| 10. any number in the set minus a 
11. any number in the set divided by a 
| 12. any number in the set multiplied by the reciprocal of a. 


| Answers to Exercise 2.3 


l. Xy + X; + Xs + X. 2. aX; 
3. aX, 4. aXs 
S. Xx 6. aX, 
7. aX, + aX, + aX, + aX, 8. a(Xı + Xs + Xs + X) 
9. (Xi + Xo + Xa + XD/4 10. X; — a 
| ll. X;la 12. (1/2) X; 


Exercise 2.4 


Let c represent a particular number and let Y;, Y», Ys, Ya, and Y; be five numbers 
in a set. Also let Y; be any number in that set. We define i as taking any integral 
value from “ 1 " to “ 5” inclusive. 

Write: 

1. the sum of the numbers in the set 
2. the sum of the products obtained by multiplying each number in the set by c 
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. c times the sum of the numbers 
. the sum of the numbers divided by 5 
. any number in the set minus the reciprocal of c 
. any number in the set divided by c 
. any number in the set multiplied by the reciprocal of c 
. the sum of the quotients obtained by dividing each number in the set by 5 
. the sum of the products obtained by multiplying each number in the set by 
the reciprocal of 5 
10. any number in the set minus the quantity, 3 times c 
11. any number in the set minus the quotient obtained by dividing the sum of 
the numbers by 5 
12. the sum of the five numbers multiplied by itself. 


iO 00 ON tA HW 


Exercise 2.5 


Let a represent a particular number. Let X, and Y, be a pair of numbers, X, and 
Y, be a second pair, X, and Y, a third pair, and X; and Y, a fourth pair. Also 
let X; be the i” number in the X set and Y; be the corresponding number in the 
Y set. 


Write: 
1. the product of X-sub-one and Y-sub-one 
. X-sub-two times Y-sub-two 
. Y-sub-three multiplied by X-sub-three 
the product of the members of the fourth pair of numbers 
. the sum of the four products obtained in (1), (2), (3), and (4) 
. a times each of the four products 
- a times the sum of the four products 
. the sum of X 
. the sum of Y 
10. the product of the sum of X and the sum of Y 
Compare your answer to (10) with your answer to (5). Are they equal? 
11. the product of the members of the it” pair of numbers 
12. a times the product of the members of the ith pair of numbers. 


o 00 MO ta S OD I2 


Answers to Exercise 2.5 


l. MY; 2. X2Y2 
3. XY. 4. XY 
B. AY: X2¥2 + XVY, + XY. 6. aX,Y, 
aXı:Y; 
aX3Y3 
ax, Y, 
d a(G Y, + XY + XY, + XY) 8. M+ Xi XS X. 
9. Yi E: Y t: Y. 4 Y, 10. (X. + X. + X, + XQ) 
(Y; 4 Yi 4 Y, 4 Y4) ; No 
11. X;Y; 12. aX; Y; 


Exercise 2.6 


Let c be a particular number. Let X; be the it? number in a set of five numbers; 
let Y; be the corresponding number in another set. 
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Write: 
1. the product of the numbers in each pair 
2. the sum of the five products obtained in (1) 
3. c times each of the five products obtained in (1) 
4. the sum of the five products obtained in (3) 
5. c times the sum obtained in (2) 
Compare your answer to (5) with your answer to (4). Are they equal? 
6. the sum of X 
7. the sum of Y 
8. the sum of Y divided by the sum of Y 
9. the ratio of X to Y for each of the five pairs 
10. the sum of the five quotients obtained in (9) 
Compare your answer to (10) with your answer to (8). Are they equal? 
11. the sum of the five products obtained in (1), divided by 5 
12. the difference between the members of each of the five pairs 
13. the sum of the five differences obtained in (12) 
14. the sum of the five differences obtained in (12), divided by 5 


Exponents 


When two quantities are multiplied together, each is called a factor of the 
product. When more than two quantities are multiplied together, each one 
or the product of any number of them is also called a factor. l 

An exponent is a number or à literal number symbol written as a superscript 
to a given quantity to express a product of identical factors. The exponent 
indicates the number of identical factors in the product, where each factor 
is the given quantity. That is, by definition, 

(one factor X) 


2 zm 
> = by (two factors X) 
X? = XXX (threes factor x 
Xm = X XXX (m factors X) 
Xm a X Xx:XocXx (n factors X) 


X? is read “ X to the first power”; X? is read "Xto the second power et or 
simply “ Y square”; X? is read “ X to the third power = X cube 3 
X? is read ** X to the m'^ power "; and X" is read “ X to the nt® power. l 
Products expressed by means of exponents may themselves be factors in 
another product. For example, the product of two powers of X, 
X? a 


has two factors, X? and X?, each of which is a product. By definition 


xy: XX 
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and 
XS xS, 
Therefore 
X? Y? = X-X-X- X- XY = X5, 
from which we see that we could have added the exponents “ 2" and “ 3" 
to obtain the exponent “ 5” of the final product. In general, then, 
Xm. Yn — ymo 
Again, by definition, 
(x?) — X?. Y?. Y? 5 XS, 
from which we see that the power of a power can be obtained by multiplying 
exponents. In general, 
(x*)™ = xm, 
Division involving two powers of the same quantity is performed by 


subtracting the exponent of the divisor from that of the dividend, as in the 
example, 


XS KIY 
>= Ss Ci = = 3-2 
x XX Pes 
or, in general, as 
Xm 
x^" = X 


when m > n 0. 
If we extend this practice of subtracting exponents to situations in which 
m = n, we find that 


Given 


we divide the numerator and the denominator by X", Thus: 
E i u 1 u 1 
Am cape 


If we subtract the exponent of the divisor from that of the dividend, we 
obtain 


xm 
— = ym-m — yo 
a = xe, 
Therefore, 
xm 
| = xa Xe. 


By a similar extension to cases in which n > m > 0, we can show that 


= " cx = yin- m) 
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where (n — m) is positive and —(n — m) is negative. Given 

Xn 

X” 
with n > m > 0, we can divide the numerator and denominator by X", 


obtaining 
1 
xn -m 


where (n — m) is positive. But if we subtract the exponent of the divisor from 
that of the dividend in the original fraction, we obtain 
xm -n 
which can be written 
X-in- m 
where (m — n) is negative and equal to —(n — m). 
We can also show that 


I 
= yä 
X di 
1 a 
pari 
and, in general where n > 0, 
l /-—n 
xX << X 
Recall that 
X9 = 1, 
It follows that we can write 
ll x? 
X^ EN 2 


Dividing by subtracting exponents yields 


1 = x = xy6—5u-x-^ 
X" xX 5 
Logarithms are special cases of exponents. It is possible to take a given 


number and express it as a power of 10. Thus 


100 = 10°, 
1,000 = 103, 
10,000 = 10%, 
0.01 = ples 10, 
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and 
1 


io = 1073, 


ll 


0.001 
Also it can be determined that 
345 = 107.5282. 


6,581 = 103.81829 
and 
0.125 


ll 


]0-.90309. 


The exponent in the right member of each of the equality statements given 
above is a logarithm. The quantity 10 is referred to as the base. The number 
which is given expression as a power of 10, that is, the quantity in the left 
member of each equality statement, is called the antilog. 

Examine the relations among the antilogs and the relations among the 
logarithms or exponents. Note that 


100 < 345 — 1,000and 2< 2.53782 < 3; 
1,000 < 6,581 < 10,000and 3 — 3.81829 < 4; 
0.100 < 0.125 — 1.000 and —1 < —0.90309 < 0. 


Logarithms employing 10 as a base are called common logarithms. Another 
System, the so-called natural logarithms, employs as a base the constant e 
whose value is approximately 2.71828. We shall confine references in the 
following discussion to the common system of logarithms. 

If one Keeps in mind that logarithms are exponents, it is not so difficult 
to see that two quantities can be multiplied by expressing each as a power 
of 10 and then adding the logarithms (exponents), that division can be 
performed by expressing the dividend and the divisor as powers of 10 and 
then subtracting logarithms (exponents), and that a power of a given quantity 
can be obtained by multiplying the logarithm (exponent) of that quantity by 
the specified power. These operations are usually put in a form in which the 
base does not appear and one works only with the logarithms (exponents), 
understanding that the base is implied. In each operation, after the com- 
putation has been performed on the logarithms (exponents), the antilog (the 
quantity expressed as a power of 10) must be read from a table in which the 
argument is the value computed from the logarithms (exponents). 

Students who wish to review logarithms and the use of logarithmic tables 


in computing should consult Hodgman (1, 1-9), Rosenbach et al. (2, 369-415), 
or Walker (3, 234-249), 


Roots and Radicals 


Consider the following definition: The nth root of a given number A is the 
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value of each of n equal factors whose product is equal to A. The nt root of 
A is written 
vA. 


Then X is the nth root of A or X = WA if 
A = XX: X--- X (n factors X). 
WA, 


n is the index, 4 is the radical sign, and A is the radicand. 


In the radical, 


Examples: 
The second or square root of A is the value of each of two equal factors 


whose product is A. Then X is the square root of A if 
X-X — A. 
We also observe that the square root of A is A‘/ since 
AW? A1? =A, 
Note that the square root of A is usually written as VA with the index 


implied. 
The third or cube root of A is the value of each of three equal factors whose 


product is A. Then X is the cube root of A if 
XXX = A 
And 41? is also the cube root of A since 
Alls A18. AIB = A, 
Roots are in some instances positive, in some instances negative, and in 
others either positive or negative. By convention 9 is assumed to refer to 


the positive root, +3; —4/9 refers to the negative root, —3. 
Radicals having the same index and the same radicand can be added or 


subtracted. For example, 
42X + 342X = 2X. 

The product of two radicals with the same index is obtained by taking the 

product of the radicands and employing one radical sign. For example, 
¥/5-8/6 = 4/30. 

The product of two radicals with the same radicand and different indices 
is obtained by changing the expressions to a form involving fractional 
exponents and then adding exponents. For example, 


43:4/3 31/2.31/3 33/6 . 32/6 35/6 0/38, 


It is sometimes convenient to rewrite a radical in factored form as 
128 = 424/64. 
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The root of 64 can then be obtained by inspection and the entire quantity 
rewritten as 
84/2. 


Factorial Notation 


The product of all the positive integers from 1 to 5 inclusive, 
Ix2x3xXx4xCX5, 


can be written in a more concise form by means of the following conventional 
notation: 
5! (read “ five factorial ”). 


In general, the product of all the positive integers from 1 to n inclusive, 


DOG) (n — 3) — 2) — Im), 


is written 
n! (read “ n factorial ”). 


The Term in Algebraic Symbolism 


An algebraic expression is made up of one or more terms, a term being the 
product of a number and one or more literal number symbols. The product 
has a prefixed sign. Thus a term is a signed quantity. When the sign is not 
written, it is understood to be positive. 

The numerical factor in a term is called the numerical coefficient. When a 
factor of a term is an unknown or a variable, the other part, which is also a 
factor, is called simply the coefficient of the term. When no number appears 
in the term, the numerical coefficient is understood to be “ 1." 

An expression involving a single term is called a monomial. An expression 


consisting of two terms is a binomial. An expression of two or more terms 
is a multinomial. Here are some examples of each. 


Monomials: NX, —nY?, aX, —3XY 
Binomials: X + Y, x — y, NX? —aXY, 3X3 —2x? 
Multinomials: X? —2XY + Y? 


a+ bX + cX? + dX? 
X? 3XtY 4 3xy24 ys 
p! —3pq + 3pq? — @ 


Technically these expressions are also polynomials but no attempt will be 
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made here to distinguish between the two terms, " multinomial” and 
“ polynomial." 

Terms may be combined when the literal factors are identical. The 
numerical coefficients are combined and written with the literal factor, as 
shown in the examples below: 

3X +2X — 4X = X, 
2X? — 3X? + 5X? = AX?, 
2XY —3XY —2XY — —3XY. 


Exercise 2.7 
Combine terms wherever possible in each of the following expressions: 


1. 3X2 — X? + 4X? 2. 4XY — 5XY + 2XY 
3. 3aX + 2X — 5aX 4. X + aX — 3XY 

5, X9— Xy XY Y* 6. nX — 2nX + 3X 

7. X? — 3X? — 2X? 8. X+ ¥2?+ 2X? — bY? 
9.3! - Y+ X* - 5Y 10. 3X*Y — 3x Y* 


Answers to Exercise 2.7 


1. 6X? 2. XY 

3. 2X — 2aX 4. X + aX — 3XY 
5, X?— 2XY + Y! 6. 3X — nX 

7. —4X* 8. 3X? + Y? — bY? 
9. 4X* + 4Y 10. 3X°¥ — 3x Y? 


Exercise 2.8 


Combine terms wherever possible in each of the following expressions: 


1. 3X + 3X? + 3X? 2. 2XYZ — 3XY + XYZ 

3. 4cY + cY — 9cY 4. aX + cY + AXY 

5 xt. XY XY- Y’ 6. mX 4- b — 5mX 

7. 2Y? 4 3Y* - AY* 8. 4X? + 4Y* + 4Z* 

9. 33 — X2Y — X3Y + XY: — XCY 10. p? + p'q + pa + pa + p'q + pa 
+ XY? XY*— Ys pg +g 


Terms are multiplied by multiplying the numerical coefficients, taking 
account of signs, of course, and applying rules for exponents to the literal 


factors. 


Exercise 2.9 


Perform the indicated multiplication. 


1. (4a)(3a) 2. (NX)(NY) 

3. (NX?) X) 4. (aX)(aY) 

5. GaX)(4aX) 6. (Sa (4XY) 

7. (SaX)(3bX) 8. (abX2)GacX Y?) 
9. (—5X?)(4X) 10. (C4Y)(—3Y) 
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Answers to Exercise 2.9 


1. 12a? 2. N*XY 

3. Nn X? 4. aAXY 

5. 12a? X? 6. 20aXY 

7. 15ab X* 8. 3a?bc X? Y? 

9. —20X* 10. 12Y? 

Exercise 2.10 

Perform the indicated multiplication. 

1. (—5b)(6b*) 2. (nX)(nY) 

3. (mX?)(n X?) 4. (—a)(—5) 

5. (—5aX)(4-5aX) 6. (8c)(cXY) 

7. (6b)(—5c) 8. (—5a)(—55)(—5c) 
9. (2x)(3x)(4x) 10. (6X)(6X?)(6X?) 


One term is divided by another by dividing the numerical coefficients, 
taking account of the signs, and applying the rules for exponents to the literal 
factors. 


Exercise 2.11 


Perform the indicated division. 


1. I0X?/aX 2. 6Y/3 

3. 3XY/X 4. 10a X*/2aX 
5. NX*IN 6. N!'XYINX 
7. n* X? Y/nY 8. SXI 

9. fXY]XY 10. /XY/NXY 
Answers to Exercise 2.11 

1. 10X/a 2 AF 

3, 3Y 4. 5X 

5, X* 6. NY 

7, nX? 8, X? 

9, f 10. SIN 
Exercise 2.12 

Perform the indicated division, 

1. 15aX*/5b 2. 30X Y/20 
3. aX Y/bXY 4. 2bX*/2ax 
5. NXY/N 6. n*[n* 

7. fXIfXY 8. fXIN 

9. fX|fX* 10. fXY/fX 


In multiplying a binomial (or a multinomial) by a monomial, multiplication 
by the single term is distributed over the two terms of the binomial (or all 
terms of the multinomial). 


The Term in Algebraic Symbolism 39 
Exercise 2.13 
Perform the indicated multiplication. 


l. a(X — Y) 2. a(X — a) 

3. a(aX — bY) 4. aX + Y) 

5. a(X + b) 6. «(Y — 2a) 

7. cX(X — Y) 8. bY(X? — Y?) 
9. ac(X* + a) 10. (a — ) XY 
Answers to Exercise 2.13 

l.aX—aY 2. aX — a 

3. àX — abY 4. aX + aY 

5. aX + ab 6. cY — 2ac 

7. cX? — cXY 8. bX?Y — bY’ 
9. acX? + ac 10. aXY — cXY 
Exercise 2.14 

Perform the indicated multiplication. 

1, 3(X + Y) 2. P(a + b) 

3. a(x — y) 4. Bn — 1) 

5. ((« — B) 6. N(N — 1) 

7. N(X? — X«JN) 8. (enn — 1) 
9. p(p — q) 10. p(1 — p) 


Notice that each of the answers to Exercises 2.13 and 2.14 can be viewed 
as a problem in factoring which means resolving a product into the factors 
which when multiplied together yield the product. For example, given the 
expression, 

aX — aY, 
and instructed to factor it, we would examine the expression, note a factor 
common to each term, and write the expression in factored form as 


a(X — Y). 
The student should practice finding the common factor in each of the 


answers to Exercises 2.13 and 2.14. 

Dividing a binomial (or a multinomial) by a mono 
of multiplication. Using the monomial as the divisor, 
buted over each term of the dividend. 


Exercise 2.15 


Perform the indicated division. 
1. (aX? — aX)JaX 

3. (4aX* + 6aY*)/2a 

Answers to Exercise 2.15 


d; 2.X42 
3. 2X* + 3Y? 4. (2+ 3X + 4X93 


mial follows the pattern 
the division is distri- 


2. (3X? + 6X*)/3X? 
4. QX + 3X? + 4X9/3X 
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Exercise 2.16 

1. (15aX — 30aY + 45aZ)/15a 2. (X* 4- X* -- X)/5X 

3. (X*Y + XY + XY?)JaXY 4. QX? + 2X? + 2X)2 X3 

The product of two binomial expressions can be obtained by distributing 
the multiplication by one binomial over each term of the other. Thus 

(X + YXX + Y) 
can be rewritten, after distribution, as 
X(X + Y)+ Y(X + Y) 


which expression now involves two simple problems in the distribution of 
multiplication. The final answer is 


X? + XY + XY + ¥Y?= x? 2VY+ Y 
By the same operations 
(Y +a} = Y? + 2aY + a? 


(X — Y} = X? —2XY + Y? 
and 


(X + YX — Y)= X — XY + XY — Y — Y — ys, 


Notice the form of each product and consider the reverse process, that is, 
resolving these answers into the binomial factors which produce them. 


Exercise 2.17 


Perform the indicated multiplication. 


1. (x + y)? 2. (« — 1X8 — 1) 
3. (n — 1} 4. (X — Mt 

5. (X — aX + a) 6. (a — d)b — c) 

7. (Y + o 8. (3X — 25 

9. (Y — aX(Y + b) 10. N(N — 1)(N — 2) 


Answers to Exercise 2.17 


l. x? + 2xy + y? 2. a—«— p+] 
3. ? —2n 4-1 4. X? — 2M,X + Mè 
5. X? — q? 6. ab — ac — bd + cd 
7. Y! - 2cY + c? 8. 9X? — 12X + 4 
9. Y? — aY + bY — ab 10. N* — 3N? + 2N 


Exercise 2.18 


Perform the indicated multiplication. 


1. (a — (8 — 1) 2. (a — a)(b — B) 

3. (n — 1)(m — 1) 4. (X — M3(Y — Mj) 
5. (u + a)? 6. (1 F rey) — rey) 
7. (€ + YF 8. (Xi + X)? 

9. (X + XXY, + Y; 10. (X + 8)? 


The Term in Algebraic Symbolism 4l 
Exercise 2.19 


You are given two sets of paired numbers, X and Y, with four numbers in each 
set. Write: 
1. the sum of X 
2. the sum of Y 
3. the sum of the sum of X and the sum of Y 
4. the sum of the four pair sums 
Compare your answers to (3) and (4). 
5. the sum of the squares of X 
6. the sum of the squares of Y 
7. the square of the sum of X 
8. the square of the sum of Y 
Compare your answers to (5) and (7); to (6) and (8). 
9. the difference between X and Y for any pair 
10. the sum of the four pair differences 
11. the difference between the sum of X and the sum of Y 
Compare the answers to (10) and (11). 
12. the sum of the squares of the four pair differences. 


Answers to Exercise 2.19 


OX CX XS Xs 

Y, - Yi Yst Yi 

(Xi + Xa+ Xs + X) (E Yat Ys Y) 
Qf, + Y) + (Xe + Y) QG + Y) +X + Y) 
X+ X+ Xi 

i+ Yi + Y} +Y} 


CRI ANAYNS 
x 
a 
+ 


.X;— YjiorX — Y 

10. (X, — Y.) + (X — Ys) + (Xa — Ys) + 06 N) 
1l. (X, + X, + M+ XQ) — (Yı + Ya + Yat Ya) . 
12. (X, — Y. + (6 — Yo? + (Xa — Yt + (Xa — Yo? 


Exercise 2.20 

We are given two sets of paired numbers. Let Is Us, and u, be the three eren 
in one set. Let vı, Va and v; be the three numbers in the second set. We shall add a 
constant value, «, to each number in the u set. We shall also add a constant value, 
B, to each number in the v set. The three numbers in the u set are now transformed 
tou + a, us + a, and us + o. The three numbers in the v set are ire + B, 
Y» + B, and v, + f. Let us refer to the numbers in the new sets as u’ and v. 

Write, in terms of the original numbers and the constants, 


. the sum of u’ 

. the sum of v’ n 
. the sum of the sum of u’ and the sum of v 
- the sum of the three pair sums 

- the sum of the squares of u’ 

the sum of the squares of v 


DAuApAwWn— 
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7. the square of the sum of x 

8. the square of the sum of v’ 

9. the difference between wv’ and v' for any pair 

10. the sum of the three pair differences 

11. the difference between the sum of wv’ and the sum of v’ 
12. the sum of the squares of the three pair differences. 


Equations 


An equation is the statement of an equality which does not hold for some 
values which might be assigned to the literal symbol(s) in the statement. 
When an equation involves one literal number symbol, it is said to be an 
equation in one unknown and the number represented by the letter can be 
determined. When an equation involves two different literal number symbols, 
it is said to be an equation in two unknowns and it is possible to determine 
the number represented by one of the letters if a value is given to the other. 
Thus an equation in two unknowns expresses a relation between two literal 
symbols. When an equation involves three unknowns, the number represented 
by any one letter can be determined if values are assigned to the other two. 


Exercise 2.21 


Write the following as equations. 

. X is equal to 7. 

. The product of 5 and X is equal to 15. 

. X plus 3 is equal to 5. 

. Six times Y is equal to 18 plus 3 times Y, 

. Three less than the product of 5 and X is equal to 32. 

. X minus 10 is equal to X". 

- The difference, Y minus 10, divided by 5 is equal to Y’, 
- The difference, X minus M,, divided by S», is equal to Z,. 
. Y is equal to the square of X. 

- Y is equal to 20 plus the product of 5 and X. 


O00 -10 t€ &UIt-— 


Answers to Exercise 2.21 


L. Am 2. SX = 15 

3. X¥4+3=5 4. 6Y — 18 - 3Y 
5, 5X —3 = 32 6. X — 10 2 x' 

7. (Y 2 10/5 = Y’ 8. (X — M,)/S, = Zy 
9 Y= X 10. Y = 20 + 5X 


Exercise 2.22 


Write the following as equations. 
1. Two more than 5 times an unknown value, X, is equal to 42. 
2. Five less than the reciprocal of the unknown, Y, is equal to 15. 
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3. Twenty-eight times the quotient, N divided by the quantity, N minus 1, is 
equal to 30. 

4. The sum of the four numbers in the X set is equal to 10. 
. 5. The quotient obtained by dividing the sum of five numbers in the Y set by 5 
is equal to 4. 

6. The sum of the squares of the four numbers in the X set is equal to 30. 

7. The sum of the squares of the five numbers in the Y set is equal to 90. 

8. The sum of the third powers of the four numbers in the X set is equal to 100. 

9. The sum of the fourth powers of the four numbers in the X set is equal to 354. 

10. The square of the difference between two unknowns, X and Y, is equal to 25. 


SOLVING SIMPLE EQUATIONS 


In an equation containing a single, literal number symbol, that symbol is 
called an unknown and determining the number(s) represented by the symbol 
is called solving the equation. Each such number is called a solution of the 
equation and is said to satisfy the equation. 

The fundamental principle in solving an equ 
certain arithmetic operations can be performed 
yielding a new equation in which the literal symbol re 
number(s) as it did in the original equation. That is to say, 
In one unknown, we can 

(1) add the same number or expression to both sides, 

(2) subtract the same number or expression from both sides, 

(3) multiply each side by the same number, or 

(4) divide each side by the same number 
and the new equation will have the same solution as the original. 

Caution: There are other operations which must be employed with 
discretion. Multiplying both members by an expression containing the 
unknown may yield an equation in which the literal symbol represents 
numbers, one or more of which will not satisfy the original equation. For 
this reason, all solutions should be checked in the original equation. Dividing 
both members by an expression containing the unknown may result in a loss 
of a solution. Raising both sides to a power poses the same problem as does 
multiplication by the unknown. Extracting the same root on both sides 
presents the same problem as dividing by the unknown. 

An equation in which the exponent of the unknown is “ o 
equation. To solve a linear equation in one unknown: 

1. Clear the equation of fractions by multiplying both sidés by the least 
common denominator of the fractions. 

2. By addition and subtraction, obtain an equation with all terms contain- 
ing the unknown on one side and all other terms on the other side. 

3. Combine terms containing the unknown. 

4. Divide both sides by the coefficient of the unknown. 


ation in one unknown is that 
on both sides of the equation 
presents the same 
given an equation 


ne ” is a linear 
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Here is an example of the solution of a linear equation in one unknown. 
Given the equation, 


2x —4 
y = 


we multiply both sides by 7. The new equation is 
2x — 4 — 28. 
We now add 4 to both sides. The equation becomes 
2x = 39. 
Finally we divide both sides by 2 and obtain the solution, 
E = 16: 


The solution can be checked by substitution in the original equation. 


Given an equation in two unknowns, one can solve for one unknown in 
terms of the other. Given the linear equation, 


2x + 3y = 7, 


one can solve for y in terms of x. First, subtract 2x from each side, thereby 
obtaining a new equation, 


3y = 7 — 2x. 
Next, divide both sides by 3 and find the solution for y to be 
t= 2x. 
daba. 


An equation in two unknowns has an unlimited number of solutions, by 
which we mean pairs of numbers which will satisfy the equation. 


Exercise 2.23 
Solve each equation for the unknown appearing in it. 
1. X¥—5=10 2. 6Y — I8 
be 3Y—8 
3. z 12 4. —g -$ 
5. X* - 3 2 19 6. 2X? — 8 = 42 
7. (X — 3)? = 25 8. Y* 6X +9 = 25 
9. /Y - 6-3 10. /Y-6—J2Y 


Answers to Exercise 2.23 
"PG ub 


! 2.Y-3 

3 ¥ = 21 4. Y — 16 

5 X= 44 6. X — 45 

7. X —8or —2 8. X = 8 or —2 
9. Y=3 10. Y=6 


Equations 
Exercise 2.24 


45 


Solve each equation for the unknown appearing in it. 


1, 5X = 15 
3..6Y — 3Y + 18 


5.5X +2 = 42 


28 N 
1. 77 = 
40 — M: 


g = 


Exercise 2.25 


KIS Fs 

4. 5X — 3 = 32 
1 

6. + — 5 = 15 


8. 1 — r2, = 0.36 


10. (X + 2)? = 25X* 


In each problem solve for the indicated unknown in terms of the other unknowns. 


1. X 42 = SY 
Y = 

3. 2X + 7Y — 18 
x= 

5. (X — Y) = 24 
X= 


7. Y=a+ bx 
X= 

9, x= X = M; 
X= 


Answers to Exercise 2.25 


1. Y — (X + 2/5 
3. X = (18 — 7Y)/2 
5.X¥=Y+8 

7. X =(Y—a/b 
9, X=x+ Mz 


Exercise 2.26 


2. m 4- 1 = 9(n + 1) 
n- 


4.p+q=1 
p= 

6. (x — y} = 36 
FL 

8. X' 2 (X — 50)/10 
X= 

10. Z, = (X — M,)/Sz 
M, = 

2. n = (m + 1)/9 — 1 

4 p=1-q 

6x=y+6 

8. X = 10X' + 50 

10. M, = X — Z.S, 


In each problem solve for the indicated unknown in terms of the other unknowns. 


l. X¥+ a= pY 


5. a(X — Y) — b? 
x= 

7. Y' = (Y — a/b 
Y= 

9.»—Y— M, 
M, = 


2. a — 1 = n(ß — 1) 


B= 

4. 3M, = Xi + Xs t Xs 
M, = 

6. (X — a)? = (Y + by 
X 


8 y — x = (Y — Mj) — (X — Mj 


10. 
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SYSTEMS OF EQUATIONS 


Suppose that we have two linear equations, each involving the same two 
unknowns. It is possible that a pair of numbers which satisfies the one 
equation will also satisfy the other. It is also possible that only one pair of 
numbers will satisfy both equations. When two linear equations are satisfied 
by one and only one pair of numbers, they constitute a system of simultaneous 
linear equations and we can solve for the numbers represented by the letters. 

Here is an example of a system of two simultaneous equations: 


x+y=6 
5x — y = 18. 
One way to solve this system of equations is to perform an arithmetic 
operation which will yield a new equation in only one unknown. Examine 


the two equations given above. If we add the left numbers and then add the 
right members, we obtain 


(Sx — y) + (x + y) = 24, 
or 
6x = 24. 
We can then divide by 6, obtaining the solution: 
x= 4, 
We can now find the value represented by y, by substituting 4 for x in either 


of the original equations and solving for y. Substituting in the first equation 
and solving for y gives 


4+y=6 
and 


y22 


We can check these values, x — 4 and y = 2, by substituting them in the 


original equations to see if each equation is satisfied. Substituting in the 
first equation gives 


44+2=6. 
Substituting in the second equation yields 
5(4) — 2 = 18. 


We observe that the original equations are satisfied by the solutions. 
Given below are three sets of simultaneous equations in two unknowns. 


Note the varied operations which can be employed to eliminate one of the 
unknowns. 
l.a.2x -y —5 
b. x+y=3 


Subtract (b) from (a) to eliminate y in (c). 
€. X —2 
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Substitute for x in (a) or (b) and solve for y. 
yzl 
Check by substituting the two solutions in (a) and (b). 
2. a. 5x — 3y = 21 


b x+y =1 
Multiply (b) by three to obtain (c). 
c. 3x + 3y =3 
Add (a) and (c) to eliminate y in (d). 
d. 8x = 24 
Solve for x. 
xm 
Substitute for x in (a) or (b) and solve for y. 
y--2 


Check by substituting the two solutions in (a) and (b). 
3. a. Tx + 2y = 38 
b. 4x + 3y = 31 
Multiply (a) by three to obtain (c). Multiply (b) by two to obtain (d). 
c. 21x + 6y = 114 
d. 8x + 6y = 62 
Subtract (d) from (c) to eliminate y in (e). 


e. 13x — 52 

Solve for x. 
x=4 

Substitute for x in either (a) and (b) and solve for y. 
pos 


Check by substituting the two solutions in (a) and (b). 

Systems of simultaneous equations in more than two unknowns can be 
solved by similar procedures. With three equations in three unknowns, 
operations are chosen which will yield two equations in two unknowns; then 
further operations are performed to produce one equation in one unknown. 
After solving for this single unknown, we go back, substitute in an equation 
with two unknowns and solve for a second. Then we go back to one of the 
original equations in three unknowns to obtain the value of the third. 
Checking is accomplished by substitution in all three of the original equations. 

Here is a demonstration of the solving of a set of three simultaneous linear 
equations in three unknowns. 

a. 3x —Oy +z =2 

b. x —2y + 3z =6 
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c 2X — y— z— —3 
Subtract (b) from (a) to eliminate y in (d). 
d. 2x — 2z = —4 
Multiply (c) by two to obtain (e). 
e. 4x — 2y — 2z = —6 
Subtract (e) from (b) to eliminate y in (f). 
f. —3x + 5z = 12 
Multiply (d) by five to obtain (g). 
g. 10x — 10z = —20 
Multiply (f) by two to obtain (h). 
h. —6x + 10z = 24 
Add (g), and (h), to eliminate z in (i). 
i. 4x=4 
Solve for x. 
x=1 


Substitute for x in (d) and solve for z. 


Z3 
Substitute for x and z in (a) and solve for y. 
y-2 


Check by substituting all three values in (a), (b), and (c). 


Exercise 2.27 


Solve each set of equations for the unknowns appearing in them. 


1. bi(5) + b(3) = 25 2. a,(20) + a«(16) + a4(30) = 208 
5,3) + b(2) = 16 4(16) + a.(10) + a,(10) = 102 
4((30) + a.(10) + a3(8) = 122 


Answers to Exercise 2.27 


1. by = 236, = 5 2. a = 23a, =3; a, =4 


Exercise 2.28 


Solve each set of equations for the un 
1. (16) + ¢,(24) = 520 
€(10) + ¢.(4) = 160 


knowns appearing in them. 
2. a(10) + b(12) + c(15) = 116 
a(12) + b(8) + c(20) = 128 
a(15) + (20) + c(16) = 154 


Functions and Graphs 


A function is a specification of the relation between two or more sets of 
numbers. The relation is usually expressed in terms of the dependence of 
one set on the other(s). When a letter symbol represents any number in a 
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set of numbers, the letter symbol is said to be a variable. There are two 
principal ways of incorporating variables in the writing of functions. In the 
one way, the number represented by an algebraic expression is shown to 
depend on the number or numbers represented by one or more variables. 
The number represented by the expression is called the va/ue of the function. 
The variable on which the value of the function depends is called the 
argument. Two examples of this first way of writing functions are given below. 
1. The expression, 


(3X + 2), 


is a “ function of X "; that is, the number represented by the expression 
depends on the number represented by X. The number represented by 
(3X + 2) is the value of the function; X is the argument. 


2. The expression, 
(X: + Xs + X3), 
is a function of X,, Xs, and X5. The number represented by (X, + Xs + X3), 
is the value of the function. There are three arguments: X,, Xs, and Xa. 
The other principal way of writing a function is to employ an equality 
statement and specify the relation between the number represented by one 
variable and the number or numbers represented by one or more other 
variables, If y is said to depend on x, then y is the value of the function and 
x is the argument. Two examples are given below. 
1. The variable y is expressed as a function of the variable x by the equality 
statement: 
y = 3x? + 2x + 5. 


Here y is the value of the function and x is the argument. 
2. The variable M, is expressed as a function of the variables Xi, Xs, Xa, 
and X, by the equality statement: 
M, LEX E XS X 
4 
M, is the value of the function; X,, X» Xs, and X, are the arguments. 

Recall that a literal number symbol sometimes refers to a particular 
number of unspecified value rather than to any number in a set. Such literal 
symbols are said to be constants. Numbers themselves are, of course, 
constants. 

A function involving only one argument lends itself quite readily to 
graphical representation. Let us suppose that we have a function involving 
X and y, where x is the argument and y is the value of the function. For a 
graphical representation we employ two intersecting straight lines, a hori- 


zontal line for the scale of x and a vertical line for the scale of y. The 
horizontal line is called the x-axis. The vertical line is called the y-axis. The 
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lines form a right angle at their intersection. The point of intersection is 
called the origin. The two lines are called the coordinate axes. 

Each scale is marked off in equal intervals and numbered from zero at the 
origin. The x-axis is numbered from the origin to the right with positive 
numbers and to the left with negative numbers. The y-axis is numbered from 
the origin upward with positive numbers and downward with negative 
numbers. 

Given an equality statement showing the relation between y and x, a value 
of y can be computed for any value assigned to x. These two values constitute 
a pair which can be represented as a point located in the two-dimensional 
space in which the coordinate axes are located. The two values are called 
the coordinates of that point. The point is located by the intersection of two 
lines, one drawn perpendicular to each axis at a point on the scale correspond- 
ing to the value of that coordinate. The coordinate which locates the line 
drawn perpendicular to the horizontal axis is called the abscissa. The co- 
ordinate which locates the line drawn perpendicular to the vertical axis is 
called the ordinate. The coordinates of a point are written (x, y) with the 
abscissa first and the ordinate second. 


For a given function we might locate a number of points and draw a 
continuous line which would indicate the locus of points whose coordinates 
would satisfy the equation. 


The linear function 
Y —0.5X +1 


yields a straight line when it is plotted. This line intersects the vertical axis 
at the point (0, 1), in which case we say that the Y intercept is one. The X 
intercept, at the point (—2, 0) on the horizontal axis, is —2. The coefficient 
of X, 4-0.5, indicates the slope of the line, the distance the line moves vertically 
per unit change horizontally. 


The function given above, Y = 0.5X + l, can be plotted by assigning 
several values to X, computing the corresponding values of Y, and drawing 
a line through the points located by these several pairs of values. Listed 
below are three arbitrarily chosen values of X and the corresponding values 
of Y. These three pairs of values have been employed, in Fig. 2.1, to locate 
three points through which a straight line has been drawn. 


X Y 
=3 —0.5 

0 1 

2 2 


, The following quadratic equation is an example of a function whose plot 
is a curved line: 


Y —2X* — 4X 4-4, 


ae ERR 
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Fig. 2.1 Plot of the linear function, Y = 0.5X + 1. 


Listed below are paired values of X and Y. These paired values have been 
plotted in Fig. 2.2. A smooth curve has been drawn through the points. 


x Y 
—4 52 
=3 34 
E 20 
=] 10 

0 4 

1 2 

2 4 

3 10 

4 20 

5 34 

6 52 

Y 
re 
I 
1-32 


pop 
i 
e 
n 


-64 
Fig. 2.2 Plot of the quadratic function, Y = 2X? — 4X + 4. 
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The Y intercept of the function in Fig. 2.2 is 4. Note that the slope of this 
curve varies, depending on X. The slope can be shown graphically by drawing 
a line tangent to the curve at any point. A tangent has been drawn at point 
P (4, 20) in Fig. 2.3. The slope of the tangent at point P is the slope of the 
curve at that point. Note that a tangent at any other point would have a 
different slope. 


P (4, 20) 


Ecko c zi zi x 


41474344 +5 p 


m 


Fig. 23 A straight line has been drawn tangent to the curve of Fig. 2.2 at point 
P (4, 20. The function is Y= 2X? — 4X + 4. The slope of the tangent, 4X — 4, at the 
point P is equal to 12. 


The slope can be expressed as a function of X. It is determined by 
differentiating the expression 2X? — 4X + 4 with respect to X.? The deriva- 
tive—that is, the slope—is equal to (4X — 4). Notice that the slope is negative 
when X < 1 and positive when X > 1. Notice also that the slope is zero 
when X — 1. In this particular example, the minimum value of the function 
occurs at the point where the slope is equal to zero. That is, the lowest point 
on the curve is located where X — 1. An important method of statistics 
involves the determination of the number represented by a literal symbol, a 
number which would give the minimum value of the expression in which the 
symbol occurs. The minimizing value can generally be obtained by a pro- 
cedure commonly used in calculus: finding the derivative, setting the deriva- 
tive equal to zero, and solving for the quantity represented by the symbol. 

A simpler, algebraic method of determining the minimum value of Y and 
the corresponding value of X is described below. Given the equation, 

Y — 2X? — 4X +4, 
we can write it in the form 
4X? —8X +8 
2 


obtained by multiplying and dividing the right-hand side of the equation by 
the coefficient of X?. The numerator on the right can be rewritten as follows: 


Y= 


"Differentiation is a major topic of calculus and is beyond the scope of the present 
review. 
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(4X? — 8X 4-4) - 4 
a T$ 


F 
or 
— 22 
p "X73 15, 


Since (2X — 2)? cannot be negative and must be equal to or greater than 
zero, we observe that Y will have its minimum value when 


(2X —2? — 0, 
in which case 
(2X —2) = 0, 
2X = 2, 
and 
X = I. 


The minimum value of Y can be determined by substituting this value of X 


in the original equation, 
Y — 2X? — 4X + 4. 


Substituting, we find that 
Y = Xy —4(1) +4 


or 
Y=2. 


Functions of Angles 


Imagine a set of coordinate axes and a line which coincides with the 
Positive segment of the X axis and which, like the hand of a clock, is free to 
rotate clockwise or counterclockwise. In Fig. 2.4, the line OQ is shown 


1 +2 43 +4 +5 +6 


Fig. 2.4 The angle @ generated by the counterclockwise rotation of the line OQ. 
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rotated counterclockwise from its initial position to the terminal position 
indicated. 

In trigonometry, an angle is the amount of rotation required to move a 
line from its initial position to a terminal position. The amount of rotation 
can be measured in terms of degrees, minutes, and seconds. 

If we let O be the point of origin, P be any point on a rotated line, and MP 
be a perpendicular erected on the horizontal axis at point M and projected 
through P, we can identify a triangle, OPM. An example of a triangle deter- 
mined in this fashion is shown in Fig. 2.5. We can also identify three values 
corresponding to the three sides of the triangle. OP is the length of the 
hypotenuse of the right triangle; OM is the distance of M from the origin, 
one of the coordinates of the point P; MP is the distance of P from the 
horizontal axis, the other coordinate of P. 


— o ot i—i + 
-6 -5 -4 -3 -2 -1 O| +1 +2 +3 +4 +5 16 


(ls 


tis 


1-6 


Fig 2.5 Determination of the right triangle OPM. P is an arbitrarily chosen point 
on line OQ. The line from M to P is perpendicular to the X axis. 


For convenience, let us use r for OP, X for OM, and Y for MP, and @ for 
the amount of rotation. Functions of the angle 0 are defined by certain ratios 
involving r, X and Y. We shall consider here only three of these ratios: the 
sine, the cosine, and the tangent of the angle 6. 

Sin 0 is read “ the sine of the angle theta ” or simply “ sine theta ” and is 
defined as the ratio of Y to r. That is, 


Y 
sin 9 = —- 
r 


Cos 0 is read “ the cosine of the angle theta " or “ cosine theta " and is 
defined as the ratio of X to r. That is, 


cos 0 = — 
r 
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Tan Ó is read “ the tangent of the angle theta ” or “ tangent theta " and is 
the ratio of Y to X. Thus 


T 
tan 0 =y 


Values of sin 0, cos 8, and tan 9, as well as the other trigonometric functions 
not given above, depend on the amount of rotation of the line and not on 
the distance OP = r which locates the point P on the line OQ. In other words, 
for any given amount of rotation, the sine, cosine, and tangent are fixed 
values. 

The student should try to integrate these ideas about the functions of 
angles with his ideas about functions in general. A trigonometric function 
involves two variables: the value of the function (a ratio) and the argument 
(the angular amount of rotation). Values of these functions are widely 
available in tabular form. See, for example, Hodgman (1, pp. 87-110) and 
Walker (3, pp. 330-331). 
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THE SIGN OF SUMMATION 


Syntactics | 3 


In psychological statistics, the Greek capital letter ** sigma," formed 2, is 
not a literal number symbol, that is, it does not represent a number. It is an 
operator and serves a very useful purpose, that of indicating the arithmetic 
operation of summing. If it were necessary to write every term in the sums 
with which one works in statistics, the symbolism would be excessively 
burdensome. Because the operation of summing can be indicated con- 
veniently and efficiently by a single letter, statistical symbolism can be 
condensed and one’s facility with it is increased immeasurably. 

In this introductory discussion of the sign of summation, we shall let X 
represent any number in one set, there being N numbers in that set. We shall 
let Y represent any number in a second set and Z any number in a third set, 
each of these sets also having N numbers. We require that X,, Y;, and Z, 
be classified together, that Xj, Y», and Z, be classified together, and so on, 
and that there are, in all, N such triplets, the members of each triplet bearing 
identical subscripts. 

The classification of the 3N numbers represented by X, Y, and Z is shown 
in Table 3.1. Columns of the table correspond to the letters Y, Y, and Z. 


Table 3.1 
A CLASSIFICATION OF NUMBERS 
Columns 
Xx Y Z 
1 X; Yi Z 


2 Xa Yi Zi 
3 X; Y; Z; 
Rows 


56 
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Rows of the table correspond to the triplets and can be identified by the 
common subscript of the three letters representing the numbers in a particular 
row. The table is rectangular and possesses as many columns as there are 
sets of numbers and as many rows as there are numbers in each set. 

In the discussion that follows, the letters a, b, and c are constants, that is, 
particular numbers of unspecified value. 

Here are the definitions of some fundamental expressions in which “ a 
occurs. 

l. OX — X, 4- Xy + Xa+. + Xn 

ZYX is read “ sum of X." 

The operation is equivalent to summing the numbers in the first column 
of Table 3.1. 

2. OY e Y, de Vege Yg doses b 

ZY is read “ sum of Y." 

The operation is the summing of numbers in the second column of 
Table 3.1. 

8 E a Zhe Le tone + Se 

XZ is read “ sum of Z.” 

The summation involves the numbers in t 

Notice that each of the summations given above has N terms. 

4. (EX) = (X, + Xa t Xa + so Xn)? 

(XX)? is read “ the square of the sum of no 

The operation involves adding the N numbers in the first column of 
Table 3.1 and squaring the resulting sum. Similar expressions could be 
written for the square of the sum of Y and the square of the sum of Z. 

5. Zxs2 X24 xb Xie + Xx 

£X? is read “ the sum of squares of X." 

The operation requires computation of the square of each number in the 
first column of Table 3.1, followed by the summation of these squares. 2 Y? 
and ZZ? are defined in a similar fashion. Notice that 

ZX? + (ZXY. 


al to the square of the sum of X, since 


he third column of Table 3.1. 


The sum of squares of X is not equ 
DX = X24 X24 X$ c. + Xi # 

(Xi + Xo + Xa +... + Xn = (ZXY. 
t terms like 2X, X5; these product terms 


The quantity, ( ZX)?, involves produc 
A numerical example is given below. 


do not appear in the quantity, zx. 
2 e+e42t+34+ 4) 
29 # 8l. 


6. XX + Y) 2 (X, + Y) + (GG + Ys) +... + (Xn + Yn) 
Z(X + Y) is read “ the sum of the pair sums of X and Y.” 
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This operation involves two summations. Referred to Table 3.1., it involves 
adding the two values in the first and second columns of each row. Then 
these row or pair sums are added. Notice the pairing of X and Y. 

7. XaX — aX, 4- aX,+aX,4+ ... + aXy 

ZlaX is read “ the sum of products of the constant a and the variable X.” 

To perform this operation on Table 3.1, one would multiply each value in 
the first column by a and then sum the resulting products. 

8. Xa —a--a-ra-... 4 a(N terms a) 

Ža is read “sum of a” and implies the repeated addition of the 
constant. Similar expressions could be written for b and c, and for other 
constants. 

9, IY= XY, bk XqYs-b Wee nn. F Rv Fv 

2XY is read “ the sum of products of variables X and Y." 

Referred to Table 3.1, this set of operations involves multiplying the values 
in the first and second columns of each row and then summing the resulting 
products. Notice the pairing of X and Y. 


Exercise 3.1 


Write, in symbols, the definition of each of the expressions listed below. X and Y 
are variables; a, b, and c are constants. 


1l. EX 2. Z(X + Y) 
3. XaX 4. Zc 

5. ZXY 6. ZY? 

7. ZbcY 8. Zb? 

9. Z(X + Yy 10. Z(X/Y) 
Il. (ZY) 12. ZaX* 


Answers to Exercise 3.1 


2 
N 
- bcY, + bcY; + bcYs +... + bcYn 
eb + b+ b+... + b 
(X, P Y)? + (Xa + Y)? + n + (Xn + Yn)? 
10. XJ Yi + X/Y; + Xu Y, + DE + Xy/Yn 
11. (Yi + Ya + Ys uus + YNE 
12. aXi + aXi + aX$ + ... + aXz 


L Xi Xs Xs Fuss XN 

2. (Xi + Y) + (X: + Y) +... (Xy + Yn) 
3. aX, + aX, + aX, + ... +aXy 
4.c+tet+et+..,. c (N terms c) 

5. Yi + XY, + XY +... + XNYN 

6. Yi + Yit Y+.. +Y 

7 

8 

9. 


Exercise 3.2 


Write, in symbols, the definition of each of the following expressions. Only a, 
b, and c are constants; all other letters are variables. 
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1 
3 
5 
Ts 
9 
l; 


1 


. 2Z 2. XX+ Y 4 Z) 
vA 4. Xxyz 
» 2xt 6. ZabX 
ZX + ZY 8. Zc? 
. X(x — YF 10. Z(XJZ) 
(Zzy 12. Za(X — Y) 


Exercise 3.3 
Translate the following verbal expressions into algebraic expressions involving 


"ym. 


NO CO Mb OON ta d QS P2 — 


. the sum of X 

. the sum of the pair sums of X and Y 
. the sum of products of X and Y 

. the sum of squares of Y 


. the square of the sum of ¥ 
. the product of the sum of X and the sum of Y 


. the square of the sum of products of X and Y 
. the sum of products of a constant, a, and a var 


iable, X 
a, and two variables, X and Y 


the sum of products of a constant, 
ich has the constant value, a 


the sum of N numbers, each of wh 


. the sum of the ratios of Y to X 


the ratio of the sum of Y to the sum of X. 


Answers to Exercise 3.3 


7. 


11. 


=X 2. XX + Y) 
ZXY 4. ZY: 
(ZX) 6, EXZY 
(2XY)? a Fd 
ZaXY . Za 
2(Y/X) 12; SIX 


Exercise 3.4 


Translate the following verb 


“pn 
“a 


. the sum of product 
. the sum of the ratios of the constan 


al expressions into algebraic expressions involving 


. the sum of the N numbers in the Y set 


the sum of the Y-and-Z pair sums 
the sum of the N products of X and Z 


. the sum of the cubes of X 


the square root of the sum of X 
the product of the sum of X and the sum of Z 

the square root of the sum of products of X and Y 

the square of the sum of products of a and X 

the sum of products of the constant a and the variable, the reciprocal of X 


the repeated addition of the constant a 
s of the reciprocals of X and Y 
t a to the variable X. 
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Simplification Rules 


Certain expressions involving “ 2” can be rewritten in forms which are 
more convenient for computation and which may be necessary for other 
reasons to be discussed later. The rewriting of these expressions is called 
simplification. There are four rules which can be applied singly and in various 
combinations to effect these simplifications. The four rules are listed and ex- 
plained below. 


RULE I. D(X + Y) = ZX + LY. 

The sum of the X-and-Y pair sums is equal to the sum of the sum of X 
and the sum of Y. 

Proof: By definition, 
Z(X + Y) = (X, + Ys) + (Xa + Yo) + (Xs + Ya) +... + (Xn + Yn). 


The terms in the right member of the equality statement can be regrouped as 


(X, + Xe+ X HE e + Xv) + (Ki + Ye + Yat... + Yn), 
which expression can then be written as 
AX + XY. 


The rule is based upon the fact that the sum of the values of X and Y is not 
affected by the grouping of those values or the order of adding them. 

Referred to Table 3.1, the expression, Z(X + Y), gives the instruction to 
obtain the sum of the two numbers, X and Y, in each row and then to com- 
bine the N row sums. Referred to the same Table, the expression, XX + LY, 
implies adding the N numbers in the X column, adding the N numbers in the 
Y column, and then adding the two column sums. 

Caution: Rewriting Z(X + Y) as ZX + ZY looks superficially like the 
distribution of multiplication by “ 2 ”, but this interpretation is not correct. 
As the symbol “ Æ ” is employed in this book, it does not represent a number, 
and, taken alone, is never a factor in multiplication. 


RULE I. ZaX = aL x. 

The sum of the products of a constant and a variable is equal to the product 
of the constant and the sum of the variable. 

Proof: By definition, 


Zax = aX, + aX,+aX,+... + aXy. 
We observe that a is a factor common to each term in the right member of 
the equality. We can remove this common factor and write the expression as 
a(X, ^X. Xa + n + Xn), 


which can then be rewritten as 
aAX. 
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RULE I. Za = Na. 

The repeated addition of a constant, when there are N terms in the 
summation, is equal to the product of N and the constant. 

Proof: By definition, 

Za=a+a+a+... +a (N terms a), 

but recall that, in arithmetic, the multiplication, N X a, is defined in the same 
way. That is, N x a is defined as the repeated addition of a, there being N 
terms in the addition. It follows that 


Xa — Na. 


RULE IV. YY cannot be rewritten in any form which simplifies com- 
putation. By definition, 
EXY = X,Y, + XaYs + XsYs +o + Xw¥w 
and, with respect to the multiplication and the addition, there is only one 


correct order of operations, multiplication before addition. 
Caution: The sum of products of two variables is not equal to the product 


of the sums of the two variables. That is, 
ZXY # ZXZY 
or 
XY, + X,Ys +... + Xn Yn # 
(X TX e + OXSXY + Ya tees Yny). 
m of products for two variables does not 


A numerical example in which the su r tW 
ables is given below. 


equal the product of the sums of the two vari 
2x344x546xT73xQ-4- 00-57) 
68 + 180 


Exercise 3.5 

Rewrite the following expressio 
above. In addition to a, 5, and c, 
are to be considered variables. 


ns in simplified form by applying the rules given 
N and numerals are constants. All other letters 


l1 Z- 2. ZbY 
cae s 4. QUY) 

5. Z(Y + a) 6. ZabX 

7. Xac 8. ZXZY 

9. Bax =ð 10. ÉL Y 

11. Eb?N 12. >> 

F Te 
- (a — b) . Em 

17. Z(X/a) A P y 


19. 2x/ ZY 


21. X(a|]Y) 


23. X(a]N) 
25. 23X?Y 
27, 2K — 2) 
29. X £X 


Answers to Exercise 3.5 
1l. 2x — Zy 


Oe 
E 
e 


11. 5*N* 


13. 9X — cZY 
15. N(a — b) 

17. (1/a) ZX 

19. ZY/ XY 

21. a X(1/Y) 

23. d 

25. 3ZX*Y 

27. ZX — 2N 

29. ZX ZX or (ZX)? 
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x+Y 


— a 


. X(X — a[N) 
. 24aY 
uy) 

. Z(a — X) 


bzY 


« IY) 
. ab ZX 
. EXZY 
.cZXY 


x+y¥* 


Z 


.aZXx* 

. ZX? —Q2ZXY + ZY? 
. Na[c 

. ZY* — 2c ZY + Ne 

. (l/ag( 2X + ZY) 


. ZX—a 
. 4a ZY 
Nu 
.Na— ZX 


* X 
px + >z is also correct but would not be considered a simplification of 


computation. 


Exercise 3.6 


Rewrite the following expressions in simplified form by applying the four 
simplification rules. In addition to a, b, c, N, and numerals, Greek letters are 


constants. All other letters are variables. 


1, Z(X + Y 4 2) 
abc 

. Z(a + bX) 
EN? 

. 2(aX — bY) 


I1. Ze 


13. Z(aX + 8) 
15. Z(« + B) 
17. £o) 

19. (ZX ZY) zz 
21. E(a/X) 


23. Z(ejp) 


NM oMb ta t 


ZaX 
ZXYZ 
Za*X? 


. Z2XZYIZ 


Zax? Y 


a 
DEFY 
. ZaX? 

. Z(X + ay 
. Z(ablc) 
LEY = aF 


yr Y+Z 


a 


. Z(XIN) 


Ne À— 
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25. 25 26. 25N 
27. Z(X* — 10) 28. ZX*Y* 
29. (2X)? 30. Z(a — X/a) 


The Summation Sign in Equations 


We have been employing literal number symbols as constants and variables, 
and we have been writing the Greek capital letter “ Z” to indicate the 
operation of summing. To indicate a summation, * X” will always be 
written with one or more other symbols; it will not be written alone. When 
* X is written with another letter, number, or combination of letters and 
numbers, the entire expression represents a number or a quantity which can 
be used in further arithmetic operations or computation. In solving a single 
equation or a system of simultaneous equations, a factor or a term involving 
the summation sign is treated as a literal symbol representing a number. On 
occasion, the solution of an equation for some specified unknown may require 
application of the rules of simplification. 

Given below are some examples of operations performed on equations 
containing one or more summation signs. 

1. The problem: Given M, = ZXIN, solve for EX. 

The solution: Multiply both sides of the equation by N and obtain 


ZX = NM. 
2. The problem: Given S, = \(2x"/N), solve for Zx?. 
The solution: Square both sides of the equation to obtain 
S2 = Xx?N. 
Multiply both sides by N to obtain 
=x? = NS. 


3. The problem: Given 
r,NS,S, = ZXY, 


solve for rgy- 
The solution: Divide both sides of the equation by NS,S, and obtain 


= Zxy | 
Tov = NS,S, 
4. The problem: Given 
y 2x? 
Sz = Mest 


solve for the quantity, N — 1. 
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The solution: Square both sides of the equation to obtain 
s? xx 
= N-I 


Multiply both sides by (N — 1) and then divide both sides by S}? to obtain 
the answer, 


2x 


N—1-2— 
Sz 


5. The problem: Given 
2(X — M) = 0, 


where M, is a constant, solve for M,. 


The solution: Apply the first and third simplification rules and obtain 
the equation, 
ZX — NM, = 0. 


Add NM, to both sides and then divide both sides by N to obtain the solution, 
M: =— 
6. The problem: Given 


where a and b are constants, solve for X X. 


The solution: Apply the first, second, and third simplification rules to 
the expression on the right side to obtain 


ZX = ; (ZX — Na). 


Multiply both sides by b and then add Na to both sides to obtain the solution, 
ZX = bZX' 4+ Na. 
7. The problem: Given the simultaneous equations: 
(1) ax? + bxy = Xxz, 


(2) aXxy--bZy* = Xyz, 
solve for a and b. 


The solution: Divide (1) by Xx? to obtain 


(3) a + bZXxy| £x? = Exz| xt. 
Divide (2) by Zxy to obtain 
(4) a+ bXy!|Exy = Xyz| Exy. 


Subtract (4) from (3) to obtain 
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: (5) b&xy/ Sx? — bXy*|Xxy = Zxz| Ex? — Lyz/ Ixy. 
Remove the common factor, b, on the left side in (5) and write 
b(Zxy|Ex? — Sy?/Lxy) = Lxz/Zx? — Lyz/Zxy. 
Divide both sides by the coefficient of b to obtain the solution, 
_ Axz[Ex? — Xyz|Zxy. 
Axy|Ex* — Ly?/ Exy 
Note that the final solution for b can be written in a different form. If we 
multiply the numerator and the denominator (above and below the bar) by 


Zx? Xxy, we obtain 


" ExyXx: — Zx*Xxyz 
: (Exyy — 2x22? 
By a similar procedure, which will not be presented here, we could solve 
for a. We would find that 

Ayz| Zy? — Zxz|Exy 


Zxy| Ey: — Exi| Exy 


which can be rewritten as 
» AXxyàyz: — Zy Zx, 
a= (my — Ext Ly 
We could compute the values of a and b, by the formulas derived above, if 
we were given the actual numbers represented by the various sums of squares 
and products: Jx?, Zy’, Zxy, ZXZ, and Zyz. 


Limits and Indices 


As the summational notation was described above, the number of terms in 
the summation was understood to be N and the summation was understood 
to involve all terms from the first to the N'h, but other possibilities must be 
taken into account. There is a conventional way of making explicit the 
number of values in the entire set to which the summation applies, and the 
last term in the series of terms. A number is written above the summation 
sign. The number is called the upper limit of the summation. Several examples 
are listed below. 

In the expression, b X, the limit, N, indicates that there are N values of X 
and that the summation ends with the Nt" value. It is understood that the 


summation begins with the first term and includes all N terms. 


n 
In the expression, X Z, the limit, n, indicates there are n values of Z and 
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the summation ends with the nt? value. It is implied that the summation 
begins with the first term and involves all n values of Z. 


c 
In Z Y, the limit, c, indicates there are c values of Y and the summation 
ends with the ct” value. It is understood that the summation begins with the 
first term and includes all c values. 


8 

In X X*, the limit indicates there are eight values of X and, consequently 
eight values of X?. It also indicates that the summation ends with the eighth 
value. It is implied that the summation begins with the first term and includes 
all eight terms. 

In each of the expressions given above, the summation begins with the 
first term but it is possible to begin with some term other than the first. There 
is a conventional way of writing a summation to make explicit where in the 
series of terms it begins. A number is written below the summation sign. It 
is referred to as the /ower limit. The lower limit indicates the number of the 
term with which the summation begins and the summation is then understood 
to include all succeeding terms including the last as indicated by the upper 
limit. Examples of expressions in which both limits are made explicit, are 
given below, with their definitions. 


N 
UX= K+ Mat Nat... Xv 
BX = Net Kat Xt... + X, 


m 
ZX= X,+ Xa t Xpt + se beg 

Since, in our work, the summation will always begin with the first term, 
there will be no need to write the lower limit. On occasion, it will be necessary, 
however, to make the upper limit explicit. Where no limits are written, the 
intended lower limit is 1 and the intended upper limit is either N or a value 
which is clearly given by the context in which the expression appears. 
: There is another feature of summational notation which should be men- 
tioned. A letter can be placed under the summation sign and written at the 
same time as a subscript to one or more variables indicating that the sub- 
Scripted variables are to be summated and that quantities with different 


subscripts or no subscripts at all are to be considered constants. 
Examples: 


N 
In the expression, ZaX; X; is a variable, a is a constant, the lower limit 


. t 
is understood to be 1, and the number of values of the variable is N. The 
symbol, i, is called the index of summation. 


n 
In the expression, Z aX;, X; is the variable in the summation, a is a con- 
¥ 
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stant, a lower limit of 1 is implied, the number of values of the variable is 
n, and j is the index of summation. 


m 

In the expression, Za; Y; Y; is the variable in the summation, a; may vary 
i=l 

but in this summation is a constant, the lower limit is 1 and is made explicit, 


the number of values of the variable is m, and i is the index of summation. 


n : 
In the expression, Z aX; Yi, there are two sets of paired values, X; and Y;. 


i 
The index of summation is i. There is a constant, a. There are n terms in the 


summation. 
When a set of numbers is classified into subsets, the succession of terms 


in a summation can be formed in more than one way and it becomes neces- 
sary to specify the particular group or series of terms intended by the sum- 
mation. This specification is also made possible by the adoption of an index 
of summation for each way of summing. In a later chapter dealing with the 
analysis of variability for classifications of numbers, limits and indices will be 
discussed more fully and it will be shown how they considerably extend the 
range of application of summational notation. 
When there is no need for limits and indices, we shall omit them. 


MOMENTS 
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Moments constitute a class of values which may be computed for any set 
of numbers by the application of a general formula. To compute a moment 
for a set of N numbers, one raises each number in the set to a specified power, 
sums these powers, and divides this sum by N. Particular moments are 
identified and named by reference to the power to which the numbers are 
raised, that is, by reference to the exponent employed. Thus the mt" moment 
in the class of moments is defined as follows: 


The m moment of a set of N numbers is the sum of the m powers of the 
N numbers, divided by N. 


Moments of X 


In summational notation, the mt" moment of Y is written 
Zxr 
N 


the first moment of X is written 


the second moment is 


the third moment is 


and the fourth moment is 
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We could, of course, continue indefinitely in the writing of higher moments. 
Notice that the zeroth moment is 
zx? 


which can be written 


THE MEAN 

The first moment of Y is called the mean of X. We shall employ M, as the 
symbol representing the mean. (X is employed by many authors for this 
purpose.) Thus 
zx 
N 
M, is computed as indicated by formula (4.1), that is, by summing the N 


numbers in a set and dividing the sum by N. 
Example: Given the numbers: 4, 4, 6, 6, 6, and 10, we let X represent any 


value in the set and N, the number of values in the set. Then 


M: = (4.1) 


AX = 36 
and 
ZX 36 6 
key e 


Moments of x 


We propose transforming X to x by the formula: 

x= X — Mx 
Y refers to any number in the original set; M, is the mean or 
s; x is any number in the new set which 
viation from the mean. 


In the formula, 
first moment of the original values; - 
results from the transformation and is called a de 


Consider now the first four moments of x. In summational notation, the 


first moment of x is written 
Ax 
= 


the second moment of x is 
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the third moment is 


Zx 
x 
and the fourth moment is 
Dd 
N 
Notice that the zeroth moment of x is 
Zx? 
N 
which can be written 
PONE MN 
N N° 
Notice also that the first moment of x, 
zx 
N 
can be written 
A(X — M,) 
N 


an expression which can be simplified, as follows: 


ZX — ZM, ZX — NM. EX NM: iy Mic 


N N N N 
THE VARIANCE 


The second moment of x is called the variance. We shall employ V, as the 
symbol for the variance. AV, is called the standard deviation and will be 
represented symbolically as S,. It is evident that V, = 8t 

Example: Given the numbers: 4, 4, 6, 6, 6, and 10, we found the mean to 
be 6. Transforming the original numbers by the formula, 

x= X — M,, 
yields a new set which consists of —2, —2, 0, 0, 0, and 4. Then 


Zx? = (—2* + (—2)? + 0* + 0? + 08 + 42 = 24 
and 


Note that S, = JV, = J4 = 2. 

The formula used above, 
xx 
ON 


is not generally convenient for computing. We can rewrite it and apply the 


L^ 


, 


—— À— 


Moments of Zy vi 
rules of simplification to obtain a computing formula which is especially well 


suited to machine computation. 


The expression, 
Ax 
N 


can be written as 
X(X — My 
CUM 


It should be evident that, if we obtain a computing formula for the numerator, 
2(X — M,)*, which is the sum of squares of deviations from the mean, the 


final computation of V, will require only division by N. 

Since M, is a constant for any given set of numbers, simplification of the 
numerator proceeds, as follows: 

ZX(Y — Ma} = 3(X? — 2M,X + M3) = EX? — 2M,ZX + NM? 
We can now substitute 2X/N for M, and obtain 
IE (2x) " (Zxy , xy 
ZX? —25- 2K +N NS = 2X A ae 

Combining the second and third terms on the right yields a computing 
formula for Z(X — M+}, 


(ZX? 

-= 4.2 

y (4.2) 
Keep in mind that formula (4.2) gives Zx? which must be divided by N 

to obtain V,. That is, 


AX? 


(Zxy 

mi eqs 
N N 

To compute the standard deviation, S,, one finds the square root of the 
variance, Vy 

Example: For the numbers: 4, 4, 6, 6, 6, and 10, ZX = 36, EX? = 240, 
and N — 6. Substituting these values in formula (4.2) yields 

36? 


20 — = 24. 


Vz 


Then V, = 24/6 = 4 and S, = 2. 


Moments of Zx 


Consider next a transformation of x to Z, by the formula, 
x x 


Za E S. 
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In mathematical statistics, Z, is called a standardized variable. In psycho- 
logical statistics, Z, is usually called a standard score or simply a “ z-score.” 
The first moment of Z, is written 


ZZ. 
WU 
The second moment of Z, is 
222 
a 
The third moment is 
Pel 
- 
The fourth moment is 
Pared 
9" 
Notice that the zeroth moment of Z,, 
ZZ? 
NU 
can be written 
A) N 
"WCNWT 
and that the first moment of zs 
ZZ. 
NO 


can be written 


Z(x[$)  (/S)Zx — (1/S,)(0) á 
UI 


N N 
Notice also that the second moment of s 
zz 
N” 


can be written 


ZIS _ Z(x*/S2)  (ushms 1 
N N CUM et 


THEINDEX OF SKEWNESS 


The third moment of Z, is called the index of skewness. Itwill be represented 
here by the symbol, S.! (The symbol, £,, was suggested by Karl Pearson and 
is widely employed.) 


"The student should not confuse S, the index of skewness, with S,, the standard deviation 
of the variable X. The subscript, x, distinguishes the standard deviation. 
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Example: Given the numbers 
4, 4, 6, 6, 6, and 10, 


for which M, = 6, V, = 4, and S, = 2, the numbers can be transformed to 


the deviations: 
—2, —2, 0, 0, 0, and 4. 


The deviations can be transformed to standard scores by the formula, 


Z= 
The standard scores produced by the transformation are 
—1, —1, 0, 0, 0, and 2. 


Then 
EZ? = (—1)8 + (—1} + 0? + 0° +07 + 2 = 6 
and 
223 6 
uim des! im 


The formula for S employed above is seldom convenient for computing 
because the standard scores are usually troublesome to obtain. A computing 
formula can be derived by substituting 

(X — Mz) 
S. 
for Z, in ZZ3/N producing 


The numerator can be rewritten as 
X(X? — 3X*M, + 3XM? — M3), 


and rewritten again as 
EX? —3M,ZX? + 3MiZX — NM}, 
and finally as 
EX? —3M,£EX? + 3NM3 — NM$ = SX? — 3M,ZX?* + 2NM$. 


The formula for the index of skewness now becomes 
8 — 2 3 
"A ZX E T 2NMi (4.3) 
NS? 

Although formula (4.3) is more convenient for computing the index of 
skewness than is the defining formula, S = 2Z3/N, it still represents a 
forbidding task on a desk calculator even when N is only moderately large. 
Fortunately, programs are now available for computing the higher moments 
on electronic machines, which can do the task with speed and accuracy. 
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THE INDEX OF KURTOSIS 


The fourth moment of Z, is called the index of kurtosis. It will be 
represented by the letter, K. (Karl Pearson used f, for this value.) 
An example: The numbers 


4, 4, 6, 6, 6, and 10, 


were transformed to standard scores in the preceding section. The standard 
scores which were produced by the transformation are 


—1, —1, 0, 0, 0, and 2. 


Then 
ZZi —(—D* + (—DY + 0* + Of + OF + 24 = 18 
and 
ZZ: 18 
uix zx Rods 


As is true of the index of skewness, the index of kurtosis is not, in general, 
easy to compute. The formula employed above can be rewritten, as follows: 


ZZi  X(S,* Eosi) | 20 _ N(X— M, 
N N N NS N 
The simplification rules can then be applied to the numerator to produce 
2(X* — 4X3M, + 6X*?M2 — 4XM3 + M3) 
which is equal to 
2X* — AM,EX? + 6M2EX? — AM3ZX + NM. 


After the last two terms are combined, the final computing formula for the 
index of kurtosis is 


K 


EX! — 4M,EX* + 6M3ZX* — 3NM1 
NY: . (4.4) 


Computation of K is possible on an automatic desk calculator but can be 
done with much greater speed and accuracy on an electronic computer. 


K 


The Descriptive Moments 


The student has probably noticed that we have paid special attention to 
the first moment of X, the second moment of x, the third moment of Z,, and 
the fourth moment of Z,. For reasons which will become clear later on, we 
shall refer to these four moments as the descriptive moments. It will be 
profitable to examine certain logical properties of the four descriptive 
moments. The logical properties of these moments with reference to a set of 
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numbers will be considered first. Their properties with reference to a fre- 
quency distribution of those numbers will be considered second. At this 
point in our discussion, those logical properties having to do with error will 
be disregarded. Later on, in the discussion of random variation, logical 
properties having to do with error will be considered. 


PROPERTIES OF M, 
Listed below are five logical properties of the first moment, or mean, of X. 
1. Inspection of the formula, 


2X 
My =" 
and the equation, 
EX = NM, 


which is easily derived from the formula, reveals that the mean of a set of 
numbers is the constant value which results when the sum of the set is 


divided into N equal quantities. 
2. The formula, 


ZX 
Mz = yy" 
can be rewritten as 
i ee eee dle 
Mz = Xi ty Xe ty Xa + Ty An 


If we let p = 1/N and substitute, the preceding statement becomes 
M, = pX, + PX: + PX; +... + pXn 


and can be interpreted, as follows: The mean is a number constituted by 
combining equal proportions of the numbers in the set. Because the mean has 
this property, one might think of it as a representative value. (It should be 
noted that p is a constant and that it is the proportion contributed to the 
value of the mean by each number in the set.) “ Representative,” as used 
here, does not mean typical or most frequently occurring. 


3. Given the numbers: 
X; Xo, Xs. T Xv, 


where Y, < Xy and there is no lower algebraic value than X, and no higher 
algebraic value than Xw, then it can be shown that 
X, < M: < Xn. 


When the numbers in a set vary in value, the mean will lie between the lowest 
value and the highest. That is, the mean can be said to be an interjacent value. 


(A proof will not be given here.) 
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Caution: The mean will not necessarily lie midway between the two 
extreme values on the scale of X and will not necessarily lie in the’ middle 
of the distribution of values, with as many values above it as below. 

4. We observe that 


for the reason that 
A(X — M) = 0. 
That is, we showed that 


Z(Y — Mj) = ZX — IM, = EX — NM, = EX — XX =0. 


Thus we can say that the sum of deviations from the mean is zero, which 
statement implies that the absolute value of the sum of negative deviations 
is equal to the absolute value of the sum of positive deviations. 

Caution: The fact that the sum of deviations from the mean is zero does 
not imply that for every positive deviation there is necessarily a corresponding 
negative deviation of equal absolute value, although such correspondence is 
possible. 

5. It can be shown that 


A(X — M,)? < X(X — ay 


when a # M,,. In words, we can state that the sum of squares of deviations 
from the mean is less than the sum of squares of deviations from any other 
reference value. That the mean has this property can be proved, as follows: 
Consider the expression, 


A(X — c)*; 


it can be viewed as being a function of c. We may ask what value c must have 
for the value of the function to be a minimum for any set of numbers. The 
expression can be simplified and rewritten as indicated below. 


IX — c = XX? — 2cX 4 ct) = EX? — 2cZX + Ne? 
= Ne -—2cXX + Xx? 
N&c? — NEX + NEX? 
N 
_ (Ne — EX) — (ZX} + NZX? 
N 


from which we see that, if N, XX, and ZX? are treated as constants, which 
they would be for any given set of numbers, then the value of the function 
depends only on c. 

We know that 


(Nc — EXP 20 


and that the value of the function will be a minimum when 
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(Nc — ZX} = 0, 
in which case 


Ne — 2X =0 
and 
-ZX OM 
c= a = Ms 
Therefore, the value of the function, 
AX — cy 
E 


is a minimum only when c = M+. 
Summary of the properties of the mean: 
1. The mean is a value which results when the sum of a set of numbers is 


divided into N equal parts. 
2. The mean is a representative value, constituted by combining equal 
proportions of all N numbers in a set. (It is not necessarily a typical value or 


the most-frequently-occurring value.) 
3. The mean is an interjacent value, with respect to the extreme values in 


a set of numbers. 
4. The sum of deviations from the mean is zero. 
5. The sum of squares of deviations from the mean is smaller than the 


sum of squares of deviations from any other reference value. 


PROPERTIES OF V. 


Two properties of the second moment of x, or the variance, are presented 


next. 
1. Let (X, — X) be the difference between a particular number and any 


number in the set including itself. The mean of these N differences, 


2%, — x), 
N 
is equal to 


ZX,— 2X NX, 2X 
N ON N 
Thus a particular deviation, x,, can be said to be a function of the differences 
between X, and the N numbers in the set. That is, 
2X, — X) 
d EM 
N 
Now the variance is a function of deviations from the mean as indicated by 
the formula, 


= Ai M; Xy 


2e 
mE n 
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Since each deviation is a function of a set of differences and the variance is a 

function of all of the deviations, the variance can be said to be a function of 

the differences among the numbers themselves. That is, the magnitude of the 

variance depends on the magnitude of the differences among the numbers. 
2. Recall that the value of the function, 


A(X — oy, 


is a minimum for any set of numbers when c = M. It follows that the second 
moment of (X — c) is a minimum when c = M,, that is, when the second 
moment is 


N N 


which is the variance as we have defined it. Thus the variance is the minimum 
value of the second moment of (X — c) where c is a constant that is to be 
subtracted from each value of X. 


XX- My ze 


PROPERTIES OF S 


The logical properties of the index of skewness will be discussed next. The 
index of skewness has been defined as 


245, 
CUN 


Note that S is a pure or abstract number in that it is not expressed in the 


units of the original scale of X, as are M, and V,. The units of the original 
scale cancel in 


S 


E. 


and the units of the new scale of Z, are actually standard deviations. Thus S 
is said to be dimensionless. 


Recall that Zx/N = 0. That is, the first moment of x, an odd moment, is 
always equal to zero, in which case the absolute value of the sum of negative 
deviations is equal to the absolute value of the sum of positive deviations. 
Recall also that ZZ,/N = 0. The first moment of Z, is always zero, in which 
case the absolute value of the sum of negative values of Z, is equal to the 
absolute value of the sum of positive values. With neither of these first 
moments, however, does the cancelling imply that there is a positive number 
of absolute value equal to the absolute value of each negative number, 
although that situation could occur. 

Now S, the third moment of Z,, is also an odd moment. When values of 
Z, are positive, the corresponding values of Zł will be positive. When values 
of Z, are negative, the corresponding values of Z2 will be negative. Thus it 
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could happen that positive cubes would cancel negative cubes completely in 
EZ, resulting in a value of S equal to zero. This cancelling will not neces- 
sarily take place in every case as it does with Zx/N and 2Z,/N. 

One situation in which the cancelling is complete and S is equal to zero 
is that in which there is a positive Z, of absolute value equal to the absolute 
value of each negative Z,. There are other situations in which this correspond- 
ence does not obtain but the cancelling is complete and S is equal to zero. 


When the absolute value of the sum of the positive values of Zł exceeds 
the absolute value of the sum of the negative values of Z$, then S is positive. 
That is, S > 0. When the absolute value of the sum of positive values of Z 
is less than the absolute value of the sum of negative values of Z3, then S 
is negative. That is, S < 0. There is, in general, no upper or lower limit on 
the value of S. 


PROPERTIES OF K 


The logical properties of the index of kurtosis will be considered next. The 
index of kurtosis has been defined as 


ar 
(0N 


As is true of S, K is a dimensionless value, a pure or abstract number. It can 
be shown that 
KzsS?*-41. 


It should be apparent that the lowest value, K — 1, can occur only when 
S — 0. K has no upper limit. It can also be shown that 
K=6& +1 
only when there are two distinct values of Z, and that 
K=1 
(which means that 2Z3/N = 2Z?2/N = 1) only when the two distinct values 


of Z, are +1 and —1, in which case the two values occur with equal 


frequency in the set of numbers. 
Given a set of numbers in which there is a positive Z, of absolute value 
equal to the absolute value of each negative Z,, in which case S = 0, 


K>1 


if the number of values of Z, greater than +1 and the number of values of 
Z, less than —1 are, together, less than the number of values of Z, between 


—] and +1. 
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Frequency Distributions 


The numbers in a given set can be classified according to their values and 
the results of this classifying can be conveniently summarized in a frequency 
distribution which represents frequency as a function of the magnitude of X. 

Consider a set of N numbers in which the distinct or different values are 


LO n 


and f, is the number of Xj's, f; is the number of X,’s, and so on. It is 
customary to say, in general, that f is the “ frequency of X." Note that 


Ac fcfc. +f, = 2f = N. 
The frequency distribution of X is shown in Table 4.1. 


Table 4.1 
FREQUENCY DISTRIBUTION FOR A SET OF N NUMBERS 
x f 
X Ai 
X; f. 
X; fs 
X, Sf, 


r = number of classes. 
f = frequency of X in each class. 
Zf—N. 


Notice that the notation has been altered. Earlier, each subscript identified 
one number in the set. Here, in the altered notation, each subscript identifies 
a class of numbers of the same value. A class may contain more than one 
number, or it may contain only one number, or it may not contain any, 
being empty or vacant. 

One possible graphical representation of a frequency distribution is called 
a histogram. In a histogram, frequency is represented on the vertical axis and 
the scale of X is placed on the horizontal axis. A rectangular column is 
erected over each value of X with a height determined by the frequency in 
that class. With equal score intervals, the only case we shall consider, the 
area under the entire histogram is directly proportional to N, the total 
number of values in the set. 

Table 4.2 presents a frequency distribution for a set of N = 48 numbers 
which range in value from 1 to 5. Figure 4.1 is a histogram which represents 
in graphical form the frequency distribution of Table 4.2. 
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Table 4.2 
A FREQUENCY DISTRIBUTION 
x f 
1 1 
2 18 
3 12 
4 14 
5 3 
N — Ef— 48. 
30 
25 
20 
fis 
10 
5 
0 i's a * 85 6 
x 


Fig. 4.1 A histogram representing the frequency distribution of Table 4.2. 


Frequency Distributions and the Descriptive Moments 


Formulas for the four descriptive moments can be written to take advantage 
of the classification provided by a frequency distribution. The sum of the 
N numbers in a frequency distribution is 

LiX + feX2 + faXa + 0s fX 
Therefore, the formula for the mean can be written 
LÉm. 
= N 


The sum of squares of deviations for the N numbers is 


fed fed feces fx 


and the variance can be written as 


Spe 


Ve = 


82 Moments 


This principle can be extended to the index of skewness to yield 


Zfz3 
me N 
and to the index of kurtosis to yield 
Xfzi 
K= = 


LOCATION OF A DISTRIBUTION 


With respect to a frequency distribution or its histogram, the mean is a 
measure of location. If we fixed or held constant all other moments of a 
distribution but allowed the mean to vary, the size and the form of the distri- 
bution would not change but the histogram would move to the right as the 
mean increased in value and to the left when the mean decreased in value. 
Therefore, we say that the mean indicates the location of the distribution. 

Figure 4.2 shows two histograms based on distributions with equal total 
frequencies, variances, indices of skewness, and indices of kurtosis. Only the 
means of the two distributions differ, a circumstance which places the two 
histograms in different positions on the scale. 


Fig. 4.2 Two histograms based on distributions with unequal means, but equal total 
frequencies, variances, indices of skewness and indices of kurtosis. The histogram drawn 
with dotted lines represents the distribution with the larger mean. 


DISPERSION OF A DISTRIBUTION 


The variance is said to be a measure of dispersion, spread, or scatter. If we 
held all moments constant except the variance, but allowed it to vary, the 
dispersion of the frequencies along the scale would increase as the variance 
increased and decrease as the variance decreased. Figure 4.3 shows two 
histograms with equal total frequencies, means, indices of skewness, and 
indices of kurtosis. The variances of the two distributions differ. Histogram A 
has a larger variance than histogram B. 
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Fig. 4.3 Two histograms based on distributions with unequal variances, but equal total 
frequencies, means, indices of skewness, and indices of kurtosis. Histogram A has a larger 


variance than histogram B. 


SKEWNESS OF A DISTRIBUTION 


S has been called an index of skewness. Skewness refers to the symmetry 
or asymmetry of a distribution or its histogram. A distribution which is 
symmetrical about the mean is called a symmetrical distribution. This implies 
that the histogram is to be divided into two parts by erecting a perpendicular 
at a point on the horizontal axis corresponding to the value of the mean. In 
a symmetrical figure, for any point in the part of the histogram above the 
mean, there is a corresponding point in the part below the mean. These 
corresponding points have the same ordinate and the absolute values of 
their abscissas are equal when the X's have been transformed to deviations 
from the mean. 

When the distribution is symmetrical, S — 0. When S Æ 0, a distribution 
is asymmetrical. When S < 0, we say the distribution is negatively skew. 
When S > 0, we say the distribution is positively skew. 

In Figure 4.4 are two histograms for distributions with equal total 


i 4 + —t 
x x 


Fig. 4.4 Histograms representing two distributions with equal total frequencies, means, 
variances, and indices of kurtosis, but unequal indices of skewness. The index of skewness 


for histogram A is 0.25; for B, it is 0.50. 


frequencies, equal means, equal variances, and equal indices of kurtosis, but 


with unequal positive indices of skewness. 
Caution: We have said above that, when a distribution is symmetrical, 
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S — 0, and that, when S + 0, a distribution is asymmetrical. One cannot say, 
in general, that, when S = 0, a distribution is symmetrical. One can say that 
when all the odd moments of Z, are zero, the distribution is symmetrical, but 
S, the third moment of Z,, can be zero while one or more of the higher odd 
moments are not zero. Figure 4.5 presents two histograms with equal total 


E ——H——3À— —À——— 

x x 
Fig. 4.5 Histograms for distributions with equal total frequencies, means, variances, 
indices of skewness, and indices of kurtosis. S = 0 and K = 2.6 for both distributions. 


Histogram A is symmetrical, but B is not. They differ with respect to their fifth moments. 
The fifth moment of Z, for A is zero; for B, it is 0.50. 


frequencies, equal means, equal variances, and equal indices of kurtosis, and 
with indices of skewness equal to zero. Histogram A is symmetrical; B is not. 


KURTOSIS OF A DISTRIBUTION 


The index of kurtosis is said to reflect the flatness or peakedness of a 
histogram, but the appropriateness of these terms is not readily apparent in 
the case of distributions with extreme values of K. The index is also said to 
reflect the concentration of frequencies at or near the center of the distribu- 
tion, relative to the concentration in the tails. Figure 4.6 shows a discrete 


=2 -1.5 -1-05 0 405 41415 +2 
Z 


x 


Fig. 4.6 A frequency distribution of Z, for which S = 0 and K = 1. 


distribution of Z, for which K = 1, the lower limit of the index. Notice the 
equal concentrations of frequencies at —1 and +1. In Figure 4.7 are two 
histograms which differ only with respect to kurtosis. Notice the difference 
between the two histograms with respect to the relative concentrations of 
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A B 
t f 
4. — M——r- — t + p 
x x 


Fig. 4.7 Histograms for two distributions which differ only with respect to their indices 
of kurtosis. K, for histogram A, is 1.5; the same moment for B is 3.5. For both histograms 


S is zero. 


frequencies in the middle interval and in the intervals at the extremes. 

We have seen how moments can be computed from a frequency distribu- 
tion. It is possible to reverse this process. There is a determining relation 
between the moments of a distribution and the frequencies in that distribu- 
tion. It is in the nature of the relation that one can choose values for the 
moments and then compute the relative frequencies determined by the chosen 
values. (The computational procedure, which involves the solution of a set 
of simultaneous equations, will not be given here.) The number of moments 
required to fix the distribution depends on the number of distinct intervals 
in the range of the entire distribution. The zeroth moment can be counted as 
one of the required number. For example, five moments—the zeroth, first, 
second, third, and fourth—would be required to determine the frequency 
distribution of a set of numbers in which there occurred the five different 
values: 1, 2, 3, 4, and 5. The frequency distribution of a set of numbers 
whose range covered six intervals on the scale of X is not fixed by five 
moments. In Figure 4.5 are two six-interval histograms with equal total 
frequencies, means, variances, indices of skewness, and indices of kurtosis. 
The two histograms differ with respect to their fifth moments. Histogram A 
has a fifth moment of Z, equal to zero; the same moment for B is 0.50. Both 
histograms have values of S equal to zero, but A is symmetrical and B is 


obviously not symmetrical. 


Other Common Transformations 


The numbers in a set may be transformed by the addition of a constant 
to each number, the subtraction of a constant from each number, the 
multiplication of each number by a constant, or the division of each number 
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by a constant. The result of one of these operations, or a combination of 
them, is a new set of numbers. It will be instructive to examine the four 
descriptive moments of the new set in relation to those of the old set. When 
numbers are transformed in this way they are often said to be coded. These 
transformations are widely used and are of considerable importance. 

First, we shall describe the consequences of adding a constant to each 
number in a set. 

Let X be any number in the original set of N numbers and let a be a 
constant. 

Let X' be a value computed from X by the formula: 


X'—X-a. 
Then X' is any number in a new set of N numbers. 
Let My be the mean of the new set. Then 
ZX 
My = Wo 


We can substitute (X + a) for X’ and obtain 


A(X +a) _ 2X + Na 
N è N 


Conclusion: Coding by adding a constant adds the constant to the mean. 
By similar methods we could show that coding by subtracting a constant 
subtracts the constant from the mean. 

Let V. be the variance of the new set. Then 

Ex^ S(X' — Mz)? 


Ve = Ny N 


We can substitute (X + a) for X’ and (M. + a) for My to obtain 


2Z[(X + a) — (M. + a)r 
EU N , 
which can be rewritten as 


My = = M, +a. 


since the constants in the numerator cancel. 

Conclusion: Coding by adding (or subtracting) a constant does not change 
the variance. Nor would the standard deviation be affected. 

Let S’ be the index of skewness of the new set. Then 


Iz sa _ Ex" 
N N NS3 


Examine the numerator, 2x". By definition, 


s’ 


- ——— — 
—— RE ———————"JS J 
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x = (X’ — M,). 
We can substitute (X + a) for X’ and (M, + a) for Mx, to obtain 
x =(X +a) —(M, +4) =X —M,=x. 
We have already found that Sy = S» that is, coding by adding or sub- 


tracting a constant does not affect the standard deviation. Therefore, if we 
substitute x for x’ and S, for Sx, we find that 


C. 
— x 


S' 


Conclusion: Coding by adding (or subtracting) a constant does not affect 
the index of skewness. 
Let K' be the index of kurtosis for the new set. Then 


zzi Axes) 2x" 


dimir : N “NE 
but we have determined previously that x’ = x and S, = Sz Therefore, 
; 2 
K = NSi zm K. 


Conclusion: Coding by adding (or subtracting) a constant does not affect 
the index of kurtosis. 

Next we shall consider the consequences of multiplying (or dividing) each 
number in a set by a constant. 

Let X' be a value computed from X by the formula: 


Y= an 


Consider the mean of X’. By definition, 


2X. 
IE 
We can substitute aX for X' and obtain 
Zax | aXX 
My = UNS = UN = aM;. 


Conclusion: Coding by multiplying by a constant multiplies the mean by the 
constant. Since division can be expressed as equivalent multiplication, we 
also conclude that dividing by a constant divides the mean by the constant. 

The variance of X’ is, by definition, 

Ex" — XX'— Moy. 
Ve N N 


88 Moments 


Substitution of aX for X’ and aM, for M. yields 
S(aX —aM,)  Zd(X — M,  a2X(X-— M; 
BEEN d N N 
Conclusion: Multiplying by a constant multiplies the variance by the square 
of the constant. Dividing by a constant divides the variance by the square 
of the constant. 
Note that if 


Vy cm i i 


E = 20; 
then 


(Vex = Sy = JV, = aV, = aS; 


Conclusion: Multiplying by a constant multiplies the standard deviation by 
a constant. Dividing by a constant divides the standard deviation by a 
constant. 

Consider next the index of skewness for the new set of numbers. We have 


already observed that 
3 


Ax 
S = pes’ 
NS? 
Let us examine x’ in the numerator, Zx”. By definition, 
x = X'— My 


and, after substitution, 
x’ = aX — aM, = a(X — M,) = ax. 


Then 

Zax ^ gx 
NS& ^ NSS 
Recall that Sy = aS,. After substituting, we find that 


, eax A 


s= 


~~ Na3S3 = NS? 
Conclusion: Since S' = S, coding by multiplying (or dividing) does not 
affect the index of skewness. 
Finally, let us consider K', the index of kurtosis for the new set of numbers. 
We have already established that 


B Ax 
|. NV 
that x’ = ax, and that V, = a?V,. We can substitute ax and a?V, in the 
formula for K' and obtain 

Za'x! asx Axt 

Nat V2? — NaM — NV 

Conclusion: Since K' — K, coding by multiplying (or dividing) by a constant 

does not change the index of kurtosis. 


K' 


> 


D 
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The effects of coding on the four descriptive moments and the standard 
deviation are summarized in Table 4.3. 


Table 4.3 


EFFECTS OF CODING OPERATIONS 
ON THE DESCRIPTIVE MOMENTS 


Operation 
X+a aX X/a 


Moment S, 


The coding operations described above are often employed in computing. 
Traditional paper-and-pencil methods of computing means, variances, and 
standard deviations for numbers classified into score or class intervals in a 
frequency distribution have involved the following steps: 

1. One chooses, as a new arbitrary origin, the midpoint of an interval, 
usually but not necessarily near the center of the distribution. (This new, 
arbitrary origin is often called an “ assumed mean " ora " guessed mean.") 

2. A “ zero ” is assigned to the interval containing the new origin. Then, 
working in both positive and negative directions from this interval, one 
assigns small whole numbers, with appropriate signs, to all other intervals. 

3. The resulting, new scale values are employed in the computation of a 
mean, a variance, and a standard deviation. 

Implicit in this procedure are the coding operations, subtracting a constant 
and dividing by a constant. The constant which is subtracted is the midpoint 
of the interval containing the assumed mean. The constant divisor is the size 
of the class interval. Moments computed from the coded values must, of 
course, be uncoded by the reverse operations. The variance must be multiplied 
by the square of the constant divisor; the mean must be multiplied by the 
constant divisor and that product must then be added to the constant which 
was subtracted. 

Of more interest to us here than these paper-and-pencil methods are 
computational procedures for an automatic, desk calculator. It is often 
convenient, when one is working with large numbers, to subtract a constant 
from each as the numbers are entered in the calculator. For example, if one 
had a set of numbers ranging in value from 105 to 181, it would be a simple 
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matter to drop the hundreds digit and compute with numbers ranging from 
5 to 81. It is also convenient sometimes to ignore decimal points. Suppose 
the original numbers had values from 0.23 to 1.91. One could drop the 
decimal point, an action which is equivalent to multiplying each number by 
100, and compute with numbers running from 23 to 191. Whenever numbers 
are coded to facilitate computation, one must, of course, take account of the 
effects of coding and, when it is necessary, uncode computed values to obtain 
final answers. 


Special Transformations 


We shall mention and describe only briefly five transformations which are 
of some considerable importance in later work in statistics. We shall not 
attempt to show their specific effects on all of the descriptive moments. 

1. Squaring. Let X represent any number in an original set of numbers. 
Let Y represent any number in a new set and, further, let Y be defined by the 
formula: 

ye 
Example. Consider the original set of numbers: 
1, 2, 3, 4, and 5. 
Squaring each produces a new set consisting of the numbers: 
1, 4, 9, 16, and 25. 
Notice that the original set is symmetrical (S — 0) and that the new set is 


not symmetrical (S — 0). 
2. Extracting the square root. Let Y be defined by the transformation 


formula: 
Uem a. 


Example. Let the original set consist of the numbers: 
4, 9, 16, 25, and 36. 
Applying a square-root transformation produces the numbers: 
2, 3, 4, 5, and 6. 

The distribution of the original set is not symmetrical; that is, S > 0. The 
new distribution is symmetrical with S = 0. 

3. Transforming to logarithms. Let Y be defined by the formula: 

Y = logyX. 
Example. Given the set of numbers: 
0.032, 0.320, and 3.200, 


Special Transformations 91 
we can rewrite these three values as 
0.320 x 10-1, 0.320 x 10°, and 0.320 x 10!. 
We might find it convenient to code these numbers by dividing each by 0.320. 
After coding, the numbers would be 
10-1, 10°, and 10!. 
The logarithmic transformation, applied to the coded values, would then 


yield 
—1, 0, and +1. 


We would undoubtedly find it more convenient to work with the transformed 
values than with the original values. Notice that the distribution of the 
transformed values is symmetrical whereas the original distribution was 


positively skew. 
4. The inverse sine transformation. Let p be any number in a set such that 


Ospsl. 
A transformation of p to 6, an angular measurement, is given by the formula: 
6 = sin Jp. 


The formula can be read: “ Theta is an angle whose sine is equal to the 
square root of p." The transformation is effected by taking each value of p, 
obtaining its square root, and referring the root to a table of sines for the 


corresponding angle. 
Example. Let the original values of p consist of the numbers: 


0.03, 0.12, 0.25, 0.41, and 0.59. 


The angles whose sines are the square roots of these original values are, 


approximately, 
10°, 20°, 30°, 40°, and 50°. 


The original values are positively skew. The transformed values are 


symmetrical. 
5. The reciprocal transformation. Let X be any number in an original set 
of numbers. Then the reciprocal transformation to Y is given by the formula: 


Y= Ux. 
Example. It is evident that the asymmetry of the distribution of the 


numbers: 
1, 1/2, 1/3, 1/4, and 1/5, 


is eliminated when the values are transformed to their reciprocals, which are 


1, 2, 3, 4, and 5. 


92 Moments 
Substitution of Ranks 


It is possible to substitute ranks for the original values in a set of numbers. 
We shall consider this substitution only for those sets of numbers for which 
the resulting ranks contain no ties. When there are no ties, the mean and the 
variance of the ranks can be expressed as a function of N. If N numbers are 
ranked without ties, the sum of the N ranks is the sum of the first N positive 
integers. That is, the sum of ranks is given by the formula, 


N(N 4-1 
ZR, = NN +1), 
2 
where R, is the rank assigned to X. Then the mean of ranks is 
N+1 
= 


The sum of squares of the N ranks is given by the formula: 
AN + DON + D, 


ZR? 6 
The variance is 
SR — (ZR 
p,-— y~ 
which, after substitution and appropriate changes, becomes 
e c 
Kus N 1 
12 


Example. Given the numbers: 
2, 5, 6, 10, and 11, 


the corresponding ranks are 
5,4, 3,.2, and Í. 
In this example, N — 5 and the sum of ranks is 


I IH. 
— —2À 


The mean is 15/5 — 3. 
The sum of squares of the ranks is 


5 +12 x 541) _ 


6 55 


and the variance is 
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Summary 


The discussion in the present chapter has consisted of a syntactical 
treatment of a class of values known as moments. The syntactical treatment 
dealt with the definitions of moments and the logical properties of the four 
descriptive moments—not including any properties which have to do with 
error. We shall leave syntactics for a time, but shall return to it in Chapter 7 
where another class of values called product moments will be introduced. 
Having laid a groundwork of symbolism in Chapters 2, 3, and 4, we have 
chosen to take up, in Chapter 5, one of the major semantical problems— 
measurement. After we have discussed the nature of confirmation in psycho- 
logical measurement, we will be ready to consider, in Chapter 6, the first 
topic in pragmatics: the use of moments in describing data. 


Table 4.4 
FIVE SETS OF JOINTLY CLASSIFIED NUMBERS 


Columns 

V Ww X y z 

1 11 42 3 12 101 
2 86 122 2 10 146 
3 39 77 3 9 129 
4 91 88 2 8 151 
5 16 62 4 13 136 
6 73 115 5 14 223 
7 30 45 4 12 150 
8 69 61 2 7 129 
9 27 57 2 10 87 
10 72 103 2 6 132 
11 37 57 4 15 157 
12 35 39 4 10 155 
Rows 13 65 80 2 8 125 
14 71 74 2 6 131 
15 59 65 2 8 119 
16 82 103 3 8 172 
17 99 130 3 10 189 
18 11 42 3 11 101 
19 7 30 8 7 97 
20 24 62 4 14 144 
21 32 56 4 13 152 
22 88 107 4 13 208 
23 24 77 1 3 54 
24 19 29 4 10 139 
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Exercise 4.1 


Compute M, and V,, M,, and Vw, M, and V,, M, and V;; in other words, 
compute the first and second descriptive moments for the variables: V, W, Y, and 
Z in Table 4.4. For variable X, compute all four descriptive moments: My, Vz, 
S, and K. 

Code variable X by adding *' one " and multiplying by '* two." Then, for the 
coded values, compute the four descriptive moments: Mx, Vx, S’, and K’. Compare 
the moments for X’ with those for X. Could you have predicted what they would be? 

Example of the use of computing formulas for the mean and variance of variable V. 

Formula (4.1) for the mean is 


ZV 
M, = N` 
ZV = 1167 and N = 24. Then 
M, = Ler a 48.625 
a 2M 
Formula (4.2) for the sum of squares of deviations is 
ro (2h 
ZV? — et 
ZV = 1167; EV? = 76,795; N = 24. Then the sum of squares of deviations is 
76,795 — EU — 20,049.625 
and the variance is 
20,049.625 
V, = ep em 835.40. 


Exercise 4.2 


For each of the eight frequency distributions given in Table 4.5, compute the 
four descriptive moments: M,, V,, S, and K. You will find that the numbers are 
simple to work with and that an automatic desk calculator is not necessary. 


Table 4.5 
EIGHT FREQUENCY DISTRIBUTIONS 

(1) (2) (3) (4) 
x f x "d x f x £ 
1 1 1 1 1 0 1 [U 
2 0 2 2 pA 10 2 11 
3 6 3 6 3 0 3 3 
4 0 4 2 4 10 4 5 
5 1 5 1 5 0 5 1 

(5) (6) (7) (8) 
X f X l4 X f X i 
1 0 1 2 1 1 1 0 
2 2 2 3 2 14 2 3 
3 3 3 15 3 24 E 3 
4 0 4 1 4 2 4 1 
5 1 5 3 5 T 5 1 
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Obtain graph paper and draw a histogram for each frequency distribution. Enter 
beside each histogram the values you obtain for the four descriptive moments. 
Study the differences among the histograms and the corresponding differences 


among the moments. 
Example of computation for distribution (1) of Table 4.5. 


Zf—-N-—S. 
XfX = 1(1) + 0Q) + 6) + 0(4) + 15) = 24 
The frequency distribution of x is 
x f 
m 1 
-—L 0 
0 6 
+1 0 
+2 1 
Sfx? = 1(—2)? + 0((—D? + 6(0)? + O(1)? + 12} = 8 
Sf 8 
Kee men! 


S,=VVz=V1 =1 
Since Z, = x/S,, the frequency distribution of Z, is 


Ze f 
2] 1 
eb 0 
0 6 
+1 0 
42 1 
zfz = 1(—2)* + ((—1» + 60° + 00) + 1} = 0 
xi 0 
m aimé mud 
zfZi- 1(—2)* + ((—D' + 60* + O(1)* + 1(2)* = 32 
2X _ 32 _ 4 


=N 8 


MEASUREMENT 


Semantics | 5 


The totality of semantical issues and problems confronting the research 
psychologist is, without doubt, awesome. It would be ill-advised, to say the 
least, to attempt the resolution of all of these issues in the two chapters which 
are devoted to them in this text. Our goal is a modest one. In the present 
chapter there will be undertaken a preliminary and limited exploration of two 
specific questions about measurement. These questions are of concern to 
research psychologists and have special relevance to the study of statistical 
methods in psychological research. In Chapter 13, the problem of confirma- 
tion as it arises in statistical inference will be taken up. 

The two questions to be considered in the present chapter are the following: 
(1) What is the nature of confirmation and disconfirmation with respect to 
the semantical designations of psychological measurement? (2) How do ideas, 
plans, and specifications for the preliminary tasks, operations, and materials 
originate in the initial period of test or scale development before confirmation 
is sought? 

There is considerable agreement among psychologists regarding certain 
notions about measurement. Numbers have properties. Behavioral events and 
products have properties, too. When there is some kind or degree of corres- 
pondence between these two sets of properties, it is legitimate and worthwhile 
to assign the numbers to the events and products. When there is no corres- 
pondence between the two sets of properties, the assignment of numbers is, 
inevitably, a fruitless and wasted effort. (somorphism is a term that is often 
employed to refer to the one-to-one correspondence of numbers and objects 
or events.) Confirmation or disconfirmation involves demonstrating in some 
public and objective fashion that correspondence does or does not obtain in 
some given instance of attempted measurement. 

When one goes beyond these basic notions about the nature of psycho- 
logical measurement and its evaluation, one finds much less agreement. It is 
not possible and, even if it were, it would not be appropriate, to present here 
a complete historical and contemporary account of the various ideas psycho- 
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logists and others have had about measurement and the requirements of 
measurement. It does seem reasonable and appropriate to describe here two 
approaches to the problem, both of which figure, in varying forms and to 
varying degrees, in current discussions of measurement.! 


The Structural Approach 


If one takes for granted the desirability of correspondence between the 
properties of numbers and the properties of behavior, it is tempting to begin 
the quest for confirmation with a logical analysis of the properties of numbers. 
Having produced a list of the logically determined properties of numbers, one 
might then require a point-by-point demonstration of correspondence be- 
tween these properties and the observable properties of the behavior. 
“ Demonstration” and “ observable" are key words here; the former 
implies that some kinds of operations, in which the objects or events being 
measured have a central role, must be performed by the investigator; the 
latter has the meaning one would expect an empirical scientist to give to it. 
For the empirical scientist, in general, measurement is quantified observation; 
it is his experience of the objects and events of the world, translated into the 
language of numbers. For the psychologist, measurement is the observation 
of behavior, given expression in the language of numbers. 

If a point-by-point determination of the correspondence of the two sets of 
properties is necessary, how does one go about staging the demonstration ? 
Let us consider an example involving a familiar kind of physical measure- 
ment, the measurement of length. Everyone is acquainted with the fact that 
numbers can be incorporated in statements like those listed below. 


L X=Y 

2, XY 

4. Y= X= A — ds 

4, XiXy 2 ats 
Let these four statements constitute our representation of at least some of 
the properties of numbers. So much for the numbers. 

For the other half of the example, we shall use ropes of varying lengths, 
chosen to suit our purposes. Now two ropes, A and B, have the property, 
A — B, when they are of the same length. That they are of the same length 
can be determined, without reference to or use of numbers, by placing them 
1No attempt will be made here to give an authoritative account of the views of particular 

re so complex, it is probable that the views of any 


individuals. In an area where the issues à 4 
ly only by that person. What follows is an attempt 


particular person could be given accurate an attem| 
to formulate two representative positions. The student may, of course, readily identify 


certain elements of the present discussion as having derived from the work of persons 
other than the author. 
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side by side, stretching them reasonably taut, and observing that the ends 
of one coincide approximately with the ends of the other. In the same kind of 
situation, they have the property, A > B, when A is longer than B, which, of 
course, is the case when one observes that the ropes coincide at one end but 
not at the other. Given four ropes—A, B, C, and D, they have the property, 
A — B = C — D, when the amount by which A extends beyond B can be 
shown to be equal to the amount by which C extends beyond D. Understand 
that the set of relations can be demonstrated by arranging and observing the 
ropes. The demonstration would not require the use of numbers in any way. 

To demonstrate the relation, 4B — CD, requires a somewhat different set 
of operations. We would need a flat surface on which to mark off a rectangular 
area using rope A for the one dimension and rope B for the other. We would 
also mark off a rectangular area using ropes C and D. By subdividing each 
area into some arbitrary unit area we could establish that the two rectangles 
enclosed the same amount of space. 

The structural approach to confirmation does not require that, in every 
case, the demonstration of relations among properties of objects or events 
must necessarily be reduced to the specific operations with ropes which were 
described above. It is conceivable, although not necessarily true, that there 
are many other sets of operations which bear little or no resemblance to the 
physical manipulation of ropes, but which would be appropriate for the 
demonstrations. 

What does seem to be clearly required by the structural approach is that 
the operations employed in demonstrating the properties of objects, as 
distinguished from the properties of numbers, are of a primitive type which 
does not involve the use of numbers and which does involve observation 
and the overt management of the observer. These primitive operations have 
the status of a criterion in that they constitute a procedure for confirming oF 
disconfirming the designation. 

Our reason for emphasizing the primitive character of the confirming 
operations and the important role of observation and management is that 
these are the features which guarantee the presence of the designatum in the 
evaluation of the sign-designatum relation. If one wished to evaluate a given 
method of psychological measurement and if there was not an obviously 
appropriate set of primitive operations available, one could make the mistake 
of confining his attention to the numbers themselves and forgetting the 
behavioral events. The numbers, of course, would prove to have the pro- 
perties that numbers have. The mistake consists of leaving out of considera- 
tion the system of behavioral properties. One would have to be very clear 
and certain that both sets of properties had actually figured in the 
demonstration. 

There are other pitfalls one must avoid in taking the structural approach. 
It is a mistake to try to find a substitute for the primitive operations, when 
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they elude one's search. An improper substitution, to which psychologists 
have been known to resort, has its origin in a misinterpretation of the role of 
observation in measurement and in science. Many psychologists are familiar 
with and agree with some form or variation of the statement: Measurement 
always involves a human observer. It is easy to rearrange this sentence into 
one which may seem to be logically defensible, but certainly is not: The action 
of a human observer always results in measurement. There is an important 
area of psychological measurement which has to do with the judging of 
physical and behavioral stimuli; it is called psychophysics. In this area, it is 
understandable that, in the absence of an obviously relevant set of primitive 
operations, the judgments themselves might be mistakenly interpreted or 
viewed as constituting these operations. This amounts to rewriting the 
sentence about measurement and the human observer, given above, into the 
sentence: The judging of any attribute in a quantitative fashion always con- 
stitutes measurement. The concrete, manipulative operations which should 
be applied to the objects being measured have disappeared from the 
evaluation scene and the focus of attention is now on the act of observation. 

To maintain that observing the relation between two ropes placed side by 
side is fundamentally the same as judging the pitch of two different tones 
presented successively, or judging the sourness of acid solutions of varying 
concentrations, is to deemphasize the role of the primitive operations in 
determining correspondence and to shift the burden of confirmation else- 
where. Perhaps that is just what should be done, but, if it is, we should 
recognize and acknowledge the fact. It may be that the important factor in 
confirmation is not the set of primitive operations, but is, rather, the quality 
of the observation itself. We shall not say what we mean by " quality "st 
this point or how it might be evaluated, since much of what will follow 
concerning the second approach will deal with these issues. hex: 

There is another kind of inappropriate substitute for the primitive 
operations—a logical analysis of the behavior in question. To try to solve the 
problem of confirmation by a logical analysis of numbers and a logical 
analysis of the behavior is to try to convert the problem from an empirical 
one to a theoretical one, Or from a semantical problem to a syntactical 
problem. One might mistakenly resort to axioms, to @ gt oer A to 
theory—perhaps theory of a mathematical sort—in an e ort to endow 
measures with the desired attributes by simply imputing these attributes to 
them. The syntax of the language in which the logical ques =| apm 
and the logical analysis of behavior are reported might " perfect, » 3 
would be devoid of empirical ipsas The semantical question abou 
confirmation would remain unanswered. r 

Before leaving the structural approach, we shall qe ta Eos pos 
disturbing feature it possesses. It provides no ds weed ie, 
sequences of failures in correspondence. The structa senes ps ose one 
a logical analysis of the properties of numbers. There is hardly any 
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can place on the number of abstract properties which might result from an 
exhaustive analysis. The analysis might cause our attention to be focused on 
properties which were irrelevant or superfluous to confirmation. What we 
need is some way of assessing the consequences of a failure in correspondence 
with respect to any given property, thereby assuring ourselves of its relevance. 
We could then establish a class of properties for which failure in corres- 
pondence had serious consequences and a class of properties for which 
failure had negligible consequences. 

To date, there is very little evidence of success in using primitive operations 
to confirm or disconfirm the designations of psychological measurement. To 
say there has been little success does not mean that there can be none, or 
that there will be none. The possibility of breaking through this barrier in 
the future cannot be denied. In the meantime, however, psychologists who 
wish to take the structural approach to confirmation find themselves in a 
difficult and, perhaps, awkward situation. What happens if one wishes to seek 
confirmation for a given scale and cannot discover the primitive operations 
required in his case? Does he abandon his scale? Not likely, for to do so 
would mean, at this stage in the development of psychology, abandoning 
most, if not all, of psychological measurement. 


The Functional Approach 


There is a sense in which one might say the structural approach is a 
molecular approach. That is, it requires an enumeration of individual pro- 
perties and the checking of correspondence property by property. The 
structural approach requires a logical decomposition of the general features 
of the number system into quite specific attributes. There is neither a priori 
reason nor evidence which compels us to believe that confirmation or dis- 
confirmation can only be achieved by means of a molecular approach. It may 
not be at all necessary to demonstrate point-by-point correspondence for a 
list of abstracted number properties. It is entirely possible that an approach, 
in which attention is directed to the general attributes of sets of numbers— 
a molar approach—can provide the evidence that is necessary and sufficient 
for confirmation or disconfirmation. 

If the structural approach is a molecular one, then the functional approach 
is molar. Essentially, it requires examination of the correspondence or rela- 
tion between two sets of paired numbers in each one of three kinds of situa- 
tions. The three situations provide the basis for evaluating three important 
characteristics of a measurement variable: (1) its distribution, (2) its 
reproducibility, and (3) its predictiveness. 

By distribution we mean that, under some, if not all, circumstances, 


The Functional Approach 101 


measures of different objects or events should vary. That is to say, a given 
psychological test or scale, if it is to satisfy the distribution criterion, should, 
when it is administered to some group of subjects, produce varied scores. If 
that test or scale fails to produce variable scores, as it is applied to various 
groups of subjects, there is no evidence of confirmation with respect to 
distribution. There is an assumption here that on any single occasion people 
respond differently and that the test or scale should reflect these response 
differences. 

A common way of exhibiting the variation in response, when it is detected, 
is to arrange the scores or measures in a frequency distribution which gives 
a pairing of each different measure of response and its frequency of occurrence 
in the group of subjects. Thus the characteristics of the relation between 
frequency and the magnitude of some measurement variable would be taken 
account of in evaluating the measurement with respect to the first criterion— 


distribution. 
By reproducibility we mean that the results of repeated measurement of the 


same object or event should not vary much. In psychological measurement, 
this second requirement calls for two sets of measures on the same variable 
and the same subjects. The two sets of measures are paired, of course, as a 
consequence of measuring each subject twice. Assuming that the distribution 
requirement is met for both sets of measures, then the reproducibility require- 
ment is met if there is a linear relation between the first set of measures and 
the second, and if the slope of the function is approximately unity. Saying 
there should be a linear relation does not imply that it should be perfect or 
that the plotted points must lie in a perfectly straight line. Some degree of 
irregularity can be tolerated. Just how this linear relation and the satis- 
factoriness of the regularity might be expressed will not be described here. 
It will be discussed in Chapter 9, which covers the descriptive uses of product 
moments. Notice that implicit in the evaluation is the ordering of subjects 


Table 5.1 


TWO SETS OF MEASURES ON THE SAME SUBJECTS; 
AN ILLUSTRATION OF REPRODUCIBILITY 


First Set Second Set 
1 17 18 
2 14 14 
3 25 26 
4 23 24 
à 5 11 12 
Subjects 6 13 12 
7 24 24 
8 21 21 
9 15 16 
10 20 19 
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on each occasion and an examination of the equality or inequality of corres- 
ponding differences on the two sets of measures. If the relation is to be linear 
and the slope is to be unity, then the difference between two subjects on the 
first occasion must be equal to the difference between the same subjects on the 
second occasion. 

As an illustration of a situation in which the reproducibility requirement 
is satisfied, two sets of measures are given in Table 5.1 and the plot of the 
two sets is given in Figure 5.1. A straight line has been drawn through the 
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Fig. 5.1 Plot of the two sets of measures in Table 5.1. Notice the distinct linear trend 
of the points. The plot indicates that the reproducibility of the measures is quite good. 


configuration of points to show how close to linearity the relation is. The 
position of the line was determined by inspection. Its slope is approximately 
unity. 

Table 5.2 


TWO SETS OF MEASURES ILLUSTRATING 
A LACK OF REPRODUCIBILITY 


First Set Second Set 
1 15 19 
2 11 22 
3 13 14 
4 23 13 
5 21 2 
Subjects 6 17 " 
7 19 20 
8 10 17 
9 14 11 
10 18 10 
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. As an illustration of a situation in which the reproducibility requirement 
is not satisfied, two sets of measures are given in Table 5.2. Their plot appears 
in Figure 5.2. No line has been drawn through the points because the 
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Fig.5.2 Plot of the two sets of measures in Table 5.2. There is no pronounced trend in 
the configuration of points. The plot indicates that the measures lack reproducibility. 


Table 5.3 


MEASURES ON TWO VARIABLES; 
AN EXAMPLE OF A PREDICTIVE RELATION 


Variables 
X F 
1 22 47 
2 16 35 
3 11 6 
4 27 38 
5 21 46 
6 30 25 
7 24 45 
8 29 30 
9 12 13 
10 18 41 
Subjects 11 26 41 
12 28 35 
13 13 20 
14 14 25 
15 23 46 
16 25 44 
17 20 45 
18 19 44 
19 15 30 


20 17 


104 Measurement 


irregularity and inconsistency of corresponding differences are extreme and 
there is no suggestion of a linear relation. 

The third requirement, in the functional approach to confirmation, is 
predictiveness. By predictiveness we mean that, given measures of the same 
objects or events on two different variables, the two sets of paired observations 
should be so related that, from a measure on one variable, one can predict 
the corresponding measure on the other variable. It is not required that the 
prediction be perfect and, consequently, one needs a standard for deciding 
when the requirement is met. The procedures commonly used for deciding 
the issue will be discussed in Chapter 9, which deals with the descriptive 
uses of product moments. 

In Table 5.3 are presented measures on twenty subjects on two different 
variables, ¥ and Y. The twenty pairs of measures are plotted in Figure 5.3. 
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Fig. 5.3 Plot of the paired measures from Table 5.3. Notice the highly predictive relation 
between X and Y. 


Notice that the pattern of points is distinctly curved indicating that the pre- 
diction from X to Y would be fairly accurate if one knew the equation for the 
curve which fits the configuration. The pattern could, of course, take many 
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Fig. 5.4 Plot of paired measures for variable X and Y. There is no indication of a 
predictive relation between the two variables. 
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other forms, including that of a simple, linear trend. A highly irregular 
pattern, such as the one displayed in Figure 5.4, would be indicative of a 
failure in predictiveness for variable X with respect to variable Y. 

It has been our experience that many students of psychology possess a 
strong intuitive appreciation of certain scale properties which have to do with 
the equality or inequality of intervals, but that relatively few students perceive 
the relevance of a functional approach in investigating these properties. In 
some instances the problem of “ equal intervals " is introduced into their 
discussions by vague references to the obvious equality of the standard units 
employed in the measurement of physical length and is then dismissed with 
an uneasy reference to the intervals on some psychological scale as being 
“ probably not equal.” This uncertain view of measurement is often accom- 
panied by misgivings about the consequences of using nonlinear transforma- 
tions of observations, such as square roots, logarithms, and reciprocals, 
because of the effect of these transformations on the scale intervals. An 
inadequacy is often revealed in the student's thinking by his apparent belief 
that the substitution of ranks can introduce a kind of regularity into his data 
which was not to be found in the original measures. 

Implicit in this uncertain and sceptical view of psychological measurement 
are many issues which could, perhaps, be made explicit, but only with the 
investment of a great deal of time and effort in the analysis. It might be un- 
profitable, in the end, to make such an investment in the analysis of vague 
ideas and concerns. It could be more worthwhile to spend our time here in 
developing an adequate approach to measurement than in trying to cope with 
an inadequate one. We propose that the only appropriate questions one can 
ask about the intervals of a given scale are the following: Is there any 
regularity to be discerned in the relation between observed differences on 
that scale and observed differences on some other scale? Can this regularity, 
if it does exist, be expressed by means of a mathematical function or 
equation? 

The nature of the intervals on a given scale may be difficult or impossible 
to determine in any absolute sense, but the regularities (or irregularities) of 
intervals on that scale in relation to the intervals of another scale can be 
determined and expressed. The procedure is not difficult. Furthermore, it is 
never impossible. That is to say, one can always examine measures on one 
scale as a function of measures on another. The procedure is a familiar and 
standard practice in many fields. It is readily available and widely applicable. 
To our knowledge, it is the only procedure that can be adopted as a standard 
practice and used routinely by the research psychologist. It may even be true 
that there is no meaningful way of discussing the properties of intervals on 
a scale, except as those intervals are related to intervals on another scale. 
The student’s tendency to put the problem in terms of physical magnitudes, 
such as length, and in terms of concrete objects, such as meter bars, may even 


106 Measurement 


be considered a resort to a primitive type of expression of functional relations. 

There is one final point to be considered—the relation between the 
structural approach and the functional approach. As appealing as the struc- 
tural approach may be to a theoretically oriented research psychologist, he 
cannot afford to disregard the requirements of the functional approach. That 
is, even if he could achieve confirmation by demonstrating point-by-point 
correspondence, which as we have indicated is difficult, uncertain, and per- 
haps impossible, he could not take distribution, reproducibility, and pre- 
dictiveness for granted. He would surely be very much interested in studying 
these characteristics of his scale and in achieving them. On the other hand, 
the practical-minded research psychologist may turn his attention directly to 
the requirements of the functional approach, assuming that if he can meet 
those requirements, then the requirements of the structural approach are not 
binding on him. 


Summary 


The important semantical question as to the nature of confirmation is a 
difficult one. Two representative positions with respect to the question's 
answer have been described briefly and in somewhat general and elementary 
terms. It is probably apparent to the student that the present author takes a 
position supporting the functional approach. There is little doubt that discus- 
sion of these and other positions will continue for a long time before anything 
resembling complete agreement or unanimity of thought is realized by 
research psychologists. 
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Before one can seek confirmation, one must have devised a provisional 
method of measurement. There must be a task or a set of tasks. Standard 
procedures, which almost always include a set of instructions for the subject, 
the investigator, or both, must be adopted. Materials or instrumentation may 
have to be devised. Arrangements have to be made for recording observa- 
tions. Out of this integrated system of tasks, procedures, and materials will 
come a record of the subject's behavior to which numbers either already are 
or soon will be assigned. The quantified record represents a provisional set 
of measures, whose quality is unknown. An attempt at confirmation of the 
designations is usually the next step. But we have discussed confirmation; let 
as turn our attention now to the origins of the scale. 

From what source do the ideas, plans, and specifications for the provisional 
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system come? It will be worthwhile to intensify our awareness and sharpen 
our perception of the source. Perhaps we should say “ sources,” for there are 
at least three principal ones. First, the psychologist imitates the work of 
others and often the imitation is built upon other imitations and is far 
removed in time from the work of the originator. Second, he acts upon 
hunches and intuitions, which arise not only out of his professional and 
research experience but also out of his everyday experience of living. Third, 
he develops theories and from a particular theory he may deduce hypotheses 
about the quantification of behavior, which lend themselves to translation 
into provisional methods of measurement. Whatever the source—imitation, 
intuition, or theory—he obtains from it no more than a provisional scale. 
The attempt at confirmation must follow. 

Most of the scales employed currently in psychological research are 
basically imitations of other scales. As we shall see later, it is appropriate 
that this should be so. Although the multitude of scales now being employed 
could be classified according to a variety of schemes, for purposes of our 
brief discussion, which is intended to provide the student with an overview 
of current practices, a simple classification scheme will be employed. The 
classification scheme involves five categories, each category being identified 
by a manifest and salient feature of the system which results in the assigning 
of numbers. 

The first category contains physical measures of response. The distance an 
athlete can put the shot, the height he can jump, his time in running a distance 
of a mile, and many other assignments of numbers similar to these qualify 
as provisional measures of response. The length of a human figure drawn by 
a child, the time which elapses between the presentation of an interview 
question and the response of the interviewee, the quantity of water drunk by 
a dog, the pressure a subject exerts on à control lever in a stressful situation, 
and the change in conductivity of the skin surface in the palm of a subject's 
hand are also potential physical measures of behavior. Many others could 
be listed in this category. Some of them, of course, are of less current interest 
to psychologists than are others. 

The second category consists of methods of measurement which require the 
Counting of responses. To count in any meaningful fashion requires, of course, 
that the response to be counted must be identifiable. Frequency counts of 
Tesponses constitute a major type of measurement in psychology and some 
of the best psychological measurement occurs in this area, The number of 
revolutions of an activity wheel produced by a rat in some standard period 
of time, the number of food pellets consumed by a rat, the number of bar 
Pressings performed by a rat, the number of arithmetic problems a child 
solves correctly on a test, the number of goals a basketball player makes in 
à game, the number of pins a subject transfers and places correctly in a 
dexterity test, the number of questions answered * Yes" on a personality 
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inventory, the number of parts assembled by a factory worker in a day, the 
number of accidents in which a truck driver is involved, the number of times 
a housewife expresses a preference for one brand of soap in comparing it 
with nine other brands, the number of activities a high-school student checks 
as his interests, the number of self-references in a client's protocol from a 
therapy session, the number of occasions on which a student signs out a 
particular reference book in the library, and the number of nonsense syllables 
recalled from a list which had been learned previously to some standard— 
these and countless others make up the class of psychological measures which 
require that responses be identified and counted. 

In the third category are the measurements of classical psychophysics. The 
psychologist records a physical measure of a stimulus under standard con- 
ditions for counting the responses of a subject who must respond to the 
stimulus with a discrimination of some sort.? The discriminatory response of 
the subject may be detecting the presence of a stimulus or detecting a 
difference between two stimuli. It has been known for a long time that 
a stimulus or a difference between stimuli of any given magnitude within a 
broad range of values on a physical dimension, will sometimes be detected 
and sometimes not. Thus in a large number of judgments of a given stimulus 
magnitude or a given difference, the proportion of correct judgments can be 
determined. Since any stimulus within a considerable range will sometimes 
be detected and sometimes not, there is no value which obviously qualifies 
as the threshold, that is, as the lowest detectable value or the smallest detect- 
able difference. The general practice is to define a threshold as a physical 
measure of the stimulus which is correctly detected in some specified propor- 
tion of trials. The procedure in determining a threshold value is to specify 
the proportion of correct judgments which will be taken as defining the 
threshold. The psychologist must then determine by systematic testing pro- 
cedures, of which there are more than can be described adequately here, the 
value of the stimulus associated with the specified proportion. If the task is 
to detect the presence of a stimulus, the value so determined is called an 
absolute threshold. If the task is to detect a difference between stimuli, the 
value is called a differential threshold. 

The intensity of a tone of specified frequency, the difference in frequency 
of two tones of specified intensity, the separation in millimeters between two 
black lines in a test of visual acuity, the voltage employed in the projection 
of taboo words on a screen, the concentration of sugar in a solution which is 
tasted repeatedly by a subject, the concentration of monosodium glutamate 
in a solution for taste testing, the ratio of the major axis of an ellipse to the 
minor axis in a visual test, the difference in temperature between two water 
baths in which the subject immerses his hands, and the angle of tilt of a chair 


2The specific psychophysical methods implied by this statement are the *' constant 
methods." The method of adjustment and the method of limits require physical measure- 
ment and repeated judgments but the data are handled differently. 
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in which the subject is seated—these physical measures of stimuli are typical 
of a vast array of psychophysical measurement variables. 

In the fourth category are measures derived from the judgments of one or 
more persons but not based on physically measured attributes of the objects 
or events judged. In the method of absolute judgment, the stimulus object 
or event is presented to the judge who responds by giving a numerical rating. 
The rating may be recorded by the judge with the help of some simple graphic 
device or it may be recorded directly as a number. The ratings given by two 
or more judges in response to the same stimulus are often averaged. In the 
method of comparative judgment, two or more stimulus objects are presented 
to a judge who expresses a preference for one of the objects, orders the objects 
on some assumed dimension, or judges the relation(s) between the objects. 
The responses of a group of judges are then tabulated and given expression 
as placing the objects in positions on one or more quantitative scales. The 
judging operations vary from one research problem to another, and, for any 
given judging operation, there may be different procedures for converting 
the observed judgments into scale values. Some conversion systems are rela- 
tively simple, as is the case when the objects are ordered according to the 
proportion of preferences expressed for each. Other conversion systems are 
somewhat complex. An example of a complex system is Thurstone's law of 
comparative judgment. 

Listed below are examples of ratings which have been applied to behavioral 
events: job efficiency, ability to do independent study, potential capacity for 


research or other creative acivity, social acceptance, interest in military 
peed and accuracy of judg- 


assignment, resourcefulness, emotional control, s 
teamwork, 


ment, agility, daring, stamina, initiative, social awareness, 
organizing ability, conformity, reasonableness, leadership, integrity, altruism, 
reliability, sportsmanship, creativity, egotism, friendliness, courage, effort, 
conduct, citizenship, cooperation, adjustment, flexibility, originality, en- 
thusiasm, supervising ability, adaptability, executive influence, eagerness, 
foresight, leadership, stinginess, obstinacy, disorderliness, bashfulness, 
sociability, emotional stability, and dominance. 

Three examples of methods of obtaining comparative judgments are given 
below. 

1. A manufacturer of perfumes desires information concerning the 
preferences of female college students for four new scents produced by his 


firm. A representative of the manufacturer interviews each one of a large 


number of young college women and obtains her preferences. In each inter- 


view the subject is given the opportunity to react to each possible pair of 
Scents and to express a preference for one member of the pair. Since there are 
six different pairings possible for four objects, each subject indicates six 
preferences. The preferences of all subjects are pooled and used to derive 


values for the four scents on a quantitative scale. 
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2. In a study of verbal associations, a psychologist develops a list of ten 
words and forms all possible groups of three, numbering 120 in all. His 
subjects are instructed to examine each triad and to choose the two words 
closest together in meaning. Data for 100 subjects are combined, the relative 
frequency with which each word is associated with every other word is com- 
puted, and the pattern of relations among the words is studied. The result 
is that the ten words are assigned positions on each one of several hypothetical 
scales. 

3. In developing a criterion of performance for enlisted men, a personnel 
psychologist had supervising officers rank their men on technical competence. 
Each officer was presented with a list of twenty enlisted men whom he had 
supervised and was instructed to order them in terms of competence, using 
for ranks the first twenty integers and giving the most competent man a rank 
of one and the least competent man a rank of twenty. 

In the attempts to quantify absolute and comparative judgments, where no 
relevant physical measure of the stimulus is available, a great deal of research 
effort has been invested in devising scoring and weighting systems. An 
account of these varied systems would necessarily involve an extensive dis- 
cussion of particular scaling theories and would go far beyond the proper 
scope of the present treatment. 

The fifth and last category into which we have attempted to classify 
psychological measures is not as large as the other four categories, but does 
contain a distinctive set of measures. Two basic procedures can be distin- 
guished in this fifth category. An example will be given below illustrating 
each procedure. 

1. A clinical psychologist who specializes in marriage counseling wishes to 
assess the similarity of interest patterns of husband and wife for each one of 
a large number of married couples. He compiles a list of twenty of the most 
frequently reported leisure-time activities and asks each subject to rank these 
twenty activities in terms of their interest to him (or her). After the rankings 
are completed, the similarity of interest patterns for a couple can be assessed 
by evaluating the relation between the husband's ranks and his wife's ranks. 
When each of the twenty activities has the same rank for both husband and 
wife, there is maximum similarity. The ranks can, of course, be assigned with 
so little consistency that there is little or no similarity. The extent of the 
relation can be perceived if one plots the wife's ranks against the husband's. 
When the ranks are assigned in exactly the same way by both parties, the 
points of the plot will lie in a straight line. When there is considerable incon- 
sistency, the points will scatter and will not define a trend. Similarity of 
response can be measured and represented by a single index number, a 
coefficient of correlation computed from the two sets of ranks. (The 
computation of this index will be described later.) 

2. A counseling psychologist wishes to measure the congruence between 
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a student's description of himself with respect to social competence and his 
description of the person he would, ideally, like to be. The psychologist 
collects an appropriate set of thirty-two statements, each of which can be 
employed for self description in regard to social competence. The statements 
are typed separately on thirty-two cards. In an interview with each of his 
student clients, he instructs the client to sort the thirty-two cards into six 
categories on a scale running from most descriptive of himself to least 
descriptive. The first and sixth categories are to contain one statement in each; 
the second and fifth categories are to contain five statements; the third and 
fourth categories are to contain ten statements. After the sorting is completed, 
the psychologist records the placement of each statement and then has the 
client repeat the task, sorting on this second occasion into six categories 
running from most descriptive to least descriptive of the person he would, 
ideally, like to be. Again the thirty-two placements are recorded. Each state- 
ment now has two placements. If every statement is given the same placement 
on both occasions, there is no discrepancy between the subject's two descrip- 
tions and congruence is maximal. On the other hand, much inconsistency in 
the placements indicates that the two descriptions are quite discrepant and 
that congruence is low. Again, as in the two previous examples, the place- 
ments from the first sorting can be plotted against the placements from the 
second. If the placements in the two sortings are identical, then all thirty-two 
points will be located on a straight line, the discrepancy will be zero and the 
congruence will be perfect. A considerable degree of scatter in the points 
would indicate a considerable degree of discrepancy in the two descriptions 
and very low congruence. Congruence, or the lack of discrepancy, can be 
measured by computing a coefficient of correlation for the two sets of 


placements. 

We have now completed ou 
purpose of this account was to 
and present measurement practi 


r account of psychological measurement. The 
provide the student with a brief review of past 
ces. It is hoped that the review will give him 
a general idea of the nature and scope of the background from which the 
psychologist can obtain ideas for measures which he can employ in his 
research. Confronted with the need for a scale, he can, and perhaps should, 
scan the whole field of psychological measurement. Is there a possibility that 
some physical measure of response would be appropriate? Can one arrange 
the task of the subject so that a threshold measure can be obtained? Could 
one specify a class of responses which can be identified and counted? Can 
judgments be employed to any advantage? Do sorting tasks offer any good 
possibilities ? 

Suggesting that a psychologist should deliberately survey the entire 
measurement scene when he is undertaking research on a given problem is 
merely to emphasize that he should consider the full range of possibilities, 
that he should take advantage of what has already been done, and that he 
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should try to relate his work to the work of others. If he does so, he will 
sometimes simply adopt the specific measuring devices which someone else 
has employed. Frequently he will find it necessary to modify or adapt the 
techniques or instruments of others and may, in this way, make an original 
contribution to the field of measurement. Occasionally his survey may yield 
him nothing suitable for his purposes and he may have to engage in extensive 
research on the measurement problem itself. 

What we have called imitation is the largest source of ideas for measurement 
in research, but it is not the only one. There is always the possibility that 
some essentially new way of measuring behavior will be invented or dis- 
covered, will be investigated and confirmed, and will then take its place in 
the body of available measurement practices. We have already mentioned the 
two sources of new ideas. They are intuition and theory. 

Naming intuition as a source of ideas for new measures is simply recog- 
nizing the fact that the source of original ideas, whatever they may concern, 
can seldom, if ever, be determined in an objective fashion or with any degree 
of certainty. A man reads widely on some topic or problem, discusses it with 
learned colleagues, thinks about it intensely over a long period of time, and 
suddenly begins to talk about it as no one else has. Is his idea really new and 
entirely original? That is doubtful, but it may be so regarded. Thus it is that 
the invention of calculus is credited to Newton—and to Leibniz. In psycho- 
logy, it may very well be that everyday experience, as well as study and 
research, plays a significant part in the conception of new ideas about 
measurement. 

Finally, the important place of theory in psychological measurement must 
be given proper recognition and emphasis. Substantial bodies of theory have 
been formulated in at least four areas of measurement: the testing of achieve- 
ment, ability, and aptitude; the assessing of attitudes; the determination of 
thresholds in psychophysics; and the scaling of absolute and comparative 
judgments. Detailed accounts of particular theories in these or other areas 
are not germane to the purpose of the present discussion. What is relevant to 
that purpose is a consideration, in general terms, of the nature of theory and 
its relation to confirmation. 

A measurement theory is an attempt to give an explanatory account or a 
systematic view of the quantification of observations of some characteristic 
of behavior. The theory is given expression in a statement of general and 
abstract principles, but there is, almost always, a body of facts on which the 
principles have been based. It may be possible to deduce hypotheses from 
the theory and to test these hypotheses by devising tasks for subjects to 
perform, methods of observing the subjects" behavior, and rules for quantify- 
ing the observations. There are two principal points of difficulty in this 
process: the deduction of hypotheses and the devising of tasks. In its highest 
form, the deductive process is mathematical and therefore absolutely rigorous. 
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In its lowest form, it might be no more than a loose sequence of plausible 
statements, in which case it is lacking in rigor and is usually the cause of 
disagreement, argument, and controversy. Translating the terms of hypotheses 
and theory into the concrete operations of measurement is always accom- 
panied by some degree of awkwardness and uncertainty. It is seldom, if ever, 
obvious as to just how this translation can be properly effected. The gap 
between the terms of the theory and the corresponding operations of 
measurement is disturbingly large. 

Theory is, of course, very important as a vehicle for original thinking about 
measurement and as a source of inventions, discoveries, new approaches, 
modifications, and elaborations, but one point, concerning what theory 
cannot in itself accomplish, should be clear: Theorizing does not itself consti- 
tute confirmation. Theorizing before or during the process of scale construc- 
tion may make confirmation more likely when confirmation is attempted, but 
there is no law that says it will do so in every endeavor. Everything depends 
on the quality of the theorizing and, ultimately, its relevance to the behavior 
in question. There are no guarantees of either quality or relevance. 


Summary 


Two approaches to confirmation were described. The structural approach 
requires the demonstration of correspondence, point by point, between pro- 
perties of numbers and properties of behavior. The functional approach 
requires the demonstration of distribution, reproducibility, and predictive- 

lace in the devising of new scales, but 


ness. Measurement theory has its p 
theorizing does not itself constitute confirmation. Confirmation is empirical; 
it cannot be deduced. 


it requires demonstration and observation; 
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Psychological data may, on occasion, take the simple form of a single set 
of measures which exhibit variability. The set may consist of repeated 
observations of the behavior of one subject, or it may consist of measures on 
a group of subjects, each of whom has been measured on one occasion. 

The psychologist may wish to describe the set of observations; that is, he 
may wish to communicate to his colleagues, his employer, the public, or his 
students, information concerning the nature of his observations. He can, of 
course, provide a very complete and exact description by simply listing all of 
the measures. While this kind of description is complete and, in a sense, exact, 
it suffers from certain definite limitations. Although it might be quite satis- 
factory to list the measures if there were only a few, it would not be at all 
satisfactory to do so if there were many. If the number of observations was 
large, it would be difficult or impossible for the investigator’s audience or 
reader to form an adequate impression concerning any of their character- 
istics and the salient features of the set of observations might not be apparent 
at all. Regularities in the data might be completely obscured by the mass of 
data and their irregularities. 

To provide the student with some experience of the difficulties inherent in 
trying to grasp the main features of a fairly large number of observations, 
we have listed in Table 6.1 scores on a test. How many scores are there? 
What are the highest and lowest values? Can you determine from an inspec- 
tion any other characteristics of the data? 

Instead of listing all of the observations, the psychologist might classify 
them in a frequency distribution and represent the distribution graphically 


by means of a histogram. The frequency distribution and the histogram 
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together provide an exact representation of the data in a form, some features 
of which are quickly and easily comprehended by the viewer. The person 


Table 6.1 
A LISTING OF TEST SCORES 
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Note: Columns and rows have no classificatory significance in the table; they represent 


simply an arbitrary arrangement of the data. 


histogram can readily determine the 


examining the distribution and its 
highest and lowest values and the points of heavy and light concentrations of 


values; he can also form an impression of the shape of the distribution: its 
symmetry or lack of symmetry, and its flatness or peakedness. Table 6.2 
contains a frequency distribution for the test scores in Table 6.1. Figure 6.1 
is a histogram for that frequency distribution. 

In theory, there is another way of providing a complete and exact 
description of a set of measures, but, in practice, it is almost always either 
very difficult or impossible. A complete description of the data might be 
given in terms of moments. The difficulty is that a large number of moments 
might be required. When the set of measures covers, in its range, 7 distinct 
score intervals, then r moments are required for a complete description. The 
r moments may include the zeroth and all others up to the (r — 1)" moment. 


116 Describing Data by Means of Moments 
Table 6.2 


A FREQUENCY DISTRIBUTION OF 
TEST SCORES FROM TABLE 6.1 


x f 
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8 10 
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11 108 

12 78 

13 38 

14 14 

15 3 


N — 384 — total number of scores. 
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Fig. 6.1 A histogram representing the frequency distribution of test scores in Table 6.2. 


Given a set of measures with five distinguishable score intervals in its 
entire range, a complete description would be provided by N, Mz, Vz, S, and 
K. That the distribution is fixed or determined when there are five score 
intervals and five moments are specified can be proved, but the proof will 
not be given here. If the distribution has six score intervals, these same five 
values will not completely describe it. That is to say, there are many different 
six-interval distributions having the same values for the four descriptive 
moments and N. The example of skewness when S = 0, which was given in 
Figure 4.5, is explained by this fact. 

The frequency distribution of Table 6.2 and the histogram of Figure 6.1 
have nine score-intervals. The value of N and eight moments would be 
required for complete description of the test scores on which the histogram 
and the frequency distribution were based. It would be quite impractical to 
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compute the higher moments, which would involve the fifth, sixth, seventh, 
and eighth powers of the variable. 

. Since complete description is often either not desirable or not practical, it 
is reasonable to consider the possibilities of partial description or sum- 
marization. The summarizing of data can be accomplished so that it provides 
information concerning, and directs attention to, the salient features of a 
collection of measures. Summarizing is computationally feasible and eco- 
nomical. It may satisfy the requirements and the obligations of the investigator 
to provide a limited summary, in which case the information communicated 
may be no more than a crude approximation to the total information con- 
tained in the data, or an extensive summary, in which case the information 
communicated may be, for all practical purposes, a very good approximation. 

Partial description or summarizing requires that the interpreter of data 
make a choice of a few moments out of a larger number, a number which 
could in fact be very large. The interpreter's choices are almost always 
limited to the first four descriptive moments, on which our attention was 
focused in Chapter 4. The choices are limited to these four moments not only 
because of the computational difficulty and cost in using the higher moments, 
but also because our understanding of, and familiarity with, the higher 
moments have not developed very far. The availability of high-speed, elec- 
tronic computers will, perhaps, change this situation and, in the future, the 
higher moments may be used more extensively. 

Limiting one's choice of moments to the first four does not solve all of the 
problems connected with that choice. As a matter of fact, the difficult 
features of the problem of choosing descriptive moments remain to be con- 
sidered. The difficult features of the problem have to do with the relation 
between the logical properties of a particular moment and the interpretation 
intended by the investigator. Developing an understanding of the relevance 
of particular moments to particular practical questions or notions is an 
extremely difficult task for the student. It is important to cultivate this under- 
Standing and not to rely on imitation, even the imitation of recognized 
authorities. Imitation without understanding is often based on superficialities 
and often leads to incorrect use of statistics. 

It is our opinion that an adequate working conceptualization with respect 
to a particular moment requires a thorough familiarity with, and a kind of 
synthesis of, the logical properties of that moment, in ready association with 
a class of practical situations and problems, which vary but have certain 
features in common. 

We propose discussing below, in somewhat general terms, the various kinds 
of situations and problems commonly encountered by the psychologist and 
the appropriate choices of one or more moments for purposes of description. 
Before beginning the discussion, however, we must emphasize one point: At 
present we will not give consideration to error or random variation as they 
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are defined statistically. We will be discussing values which are properly 
classified as d-statistics and not i-statistics. In Chapter 12, consideration will 
be given to i-statistics and logical properties having to do with error will be 
the focus of attention. 

If a very limited description of a set of measures is to be given in the form 
of a single d-statistic, the choice will probably, but not necessarily, be the 
mean. The mean conveys information about the magnitude of the measures, 
but, of course, only in a general sense. The recipient of the information should 
know that the measures vary around the mean and that they lie in a region 
of the scale roughly indicated or approximated by the value of the mean. He 
should also realize that knowing only the mean leaves him without any other 
information. An assumption regarding other characteristics of the data would 
be risky. For example, he knows nothing of the magnitude of differences 
among the measures. 

The mean of the test scores in Table 6.1 and the distribution in Table 6.2 
is 11. Reflect, for a moment, on the fact that the mean is 11, and consider 
how much, or how little, one knows about the scores. Examine again the 
distribution in Table 6.2 and try to specify some of the properties of the 
distribution not revealed by the value of the mean. 

Somewhat more completeness of description is achieved when two 
moments, the mean and the variance (or the standard deviation) are com- 
puted. The mean gives the general level of performance; the variance indicates 
the general magnitude of differences among the measures. It is a common- 
place now to present the mean and variance of a distribution of observations 
and, in the use of moments, this practice occurs with far greater frequency 
than any other. 

The variance of the distribution of scores in Table 6.2 is 2. Then S; = «2. 
We have already indicated that the mean is 11. Recall that the variance is 
the average of the squares of deviations from the mean. It follows that, for 
the squared deviations, the variance is an interjacent value. That is, there are 
some squared deviations greater than 2 and some less than 2, if they vary at 
all. We also know that some of the original deviations were positive and 
some, negative. Knowing the mean and the variance gives one some crude 
notion of the general magnitude and the dispersion of the scores, but 
certainly does not give a clear or exact picture. 

Computation of the four descriptive moments is probably the most 
complete description which may, at the present time, be said to be practical. 
It should be understood, however, that the distribution is not fixed in all 
respects by determination of the four values: M,, Va S, and K. Although 
a number of moments equal to the number of score intervals would be 
required for a complete description, it is our opinion that distributions of 
psychological measures are quite adequately described by the first four 
moments and that variation in features not fixed by these values can, in 
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practice, be disregarded. In other words, we are saying that the first four 
moments will take account of the important regularities or salient features 
in psychological data and that the irregularities or details not taken account 
of can be ignored. This opinion should not be mistaken for a statement of fact. 
In the future, experience may indicate the desirability and even the necessity 
for going on to additional, higher moments. 

The index of skewness for the distribution in Table 6.2 is +0.177. The 
index of kurtosis is 2.875. Thus the first four descriptive moments can be 
summarized as follows: 


M, =U), 
KB = 2; 
S = 0.177, 
K = 2.875. 


The distribution is positively skew, but only moderately so. In the range of 
values for K commonly encountered in practice, the obtained value is not 
extreme; it lies well within the range of moderate values of K. 

Description of the distribution in Table 6.2 in terms of four moments is 


x 


Fig. 6.2 A five-interval histogram for a distribution completely described by the four 
descriptive moments: Mz = 11, Vo 25 = 0.177, and K = 2.875. N = 384. 


only approximate; it is not complete, as would be a description based on 
nine moments. This approximate description might, however, be con- 
ceptualized in terms of a distribution which had fewer intervals, but could 
be described exactly by four moments. For example, the regularities of the 
original nine-interval distribution of Table 6.2, as these regularities are por- 
trayed by the four descriptive moments, might be thought of as the character- 
istics of a five-interval distribution which is completely described by those 
moments and the zeroth moment or N. The original, nine-interval distribu- 
tion of Table 6.2 has the following moments and total frequency: M; = 11, 
Va = 2, S = 0177, K = 2.875; and N = 384. These values describe the 
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original distribution inexactly, but they provide an exact description of the 
distribution represented by the histogram in Figure 6.2. One might, therefore, 
conceptualize this description of Figure 6.1 in terms of Figure 6.2. 


Measures on a Single Subject 


In the preceding section, a distribution of test scores for 384 subjects was 
described through the use of moments. We shall consider next an example 
involving the description of 48 measures on a single subject. 

A psychologist measured the reaction time of a single subject on presenta- 
tion of an auditory stimulus on 48 trials. Measures were recorded in twenty- 
fifths of a second. The 48 measures are listed in Table 6.3. The frequency 
distribution is given in Table 6.4. The histogram is shown in Figure 6.3. 


Table 6.3 
MEASURES OF REACTION TIME FOR A SINGLE SUBJECT 


UA CA 4 CA CA RY 
UARADAUNA 
ARUDA 
PRRIUUARUD 


MAPARUAWA 
Ov ta 4& d CA S3 4S CA 


Each observation is expressed in twenty-fifths of a second. Columns and rows have no 
classificatory significance; the arrangement is arbitrary. 


Table 6.4 


A FREQUENCY DISTRIBUTION OF MEASURES 
OF REACTION TIME 


x d 
3 1 
4 16 
5 18 
6 8 
7 5 


Original data given in Table 6.3. X — Measure of reaction time in twenty-fifths of a 
second. 


The four descriptive moments have been computed and have the following 


values: 
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20 


ion of reaction time measures 


Fig. 6.3 A histogram representing the frequency distributi 
— reaction time measured in 


in Table 6.4. M, = 5, V, = 1, S = 0.50, and K = 2.5. x 
twenty-fifths of a second. 


Mz = 5, 
V, =1, 
S = 0.50, 
K = 2.50. 


bution is positively skew, since one would 


It is not surprising that the distri ) 
t on reaction time, a limit imposed possibly 


expect the existence of a lower limi 
by physiological and neurological factors. 


Some Issues in Description 


ect or a group of subjects, by presenting the 
ect with another or one group with another 


by presenting the difference between two means, interest is often directed to 
one end of the scale or the other. The investigator may wish to point out that 


the individual or group is high or low on the scale, as the case may be. When 
ish to point out that one individual or 


he is making a comparison, he may WI 
one group is higher or lower than another individual or group. There are 
Occasions, however, when interest is directed to the relation between the 
mean and some important reference point. For example, the measures might 
be positive and negative in value, representing deviations above and below 
some true value or known standard. 

The choice of the mean as the only descriptive value to be presented is 
Sometimes justified ; sometimes, it is not. When the variation around the 
mean is judged to be the result of measurement errors or momentary and 
transitory fluctuations in response, as might well be the case in certain kinds 


of repeated measurements on à single subject. then computing a mean may 


In characterizing a single subj 
mean, or in comparing one subj 
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be looked upon as eliminating error, that is, as removing variability which 
only obscures the true nature of the performance and which properly should 
be removed. An example would be establishing a threshold for visual acuity 
by repeated determinations under the same conditions. Variation in the values 
obtained might be considered error and might, therefore, be eliminated by 
computing a mean. 

When individuals or groups are known to have approximately equal 
variances and higher moments, the presentation of means may be all that is 
required for a comparison. Since, under these conditions, the individuals or 
groups being compared would be alike in all respects save the one on which 
the comparison is to be made, the interpretation could be made straight- 
forward and unambiguous. Any difference resulting from the comparison 
would be properly characterized as having to do with this single attribute as 
represented by mean values and with no other attributes. 

In choosing the mean as a descriptive value over other values, it is possible, 
of course, to take an arbitrary position by saying that one's interest in the 
evaluation is defined as having to do with means. The inadequacy of this 
arbitrary position may be revealed, however, when an interpretation is made 
which implies the need for more information than is contained in the means. 
Here are two examples of inadequate interpretations resulting from the use 
of means alone. 

1. Two men are tested repeatedly and then compared with respect to their 
aptitude for handling certain heavy construction equipment. One individual 
has a higher mean score than the other, thereby appearing to be the more 
skillful of the two, but further examination of the data reveals that this 
individual is much more varied in his performances than is the other and that 
some of his scores are low enough to imply serious consequences for his safety 
and the safety of others in the real-life situation. An intelligent choice between 
the two individuals could not be made without information as to the 
variability of their performances. 

2. The freshman class of each of two large universities was administered 
a standard college admissions test. The two means were computed and com- 
pared. The report was circulated that the one class was superior to the other, 
thereby leaving the impression that the one university had a distinctly better 
freshman class than did the other. Further examination of the data would 
have revealed that the difference between the two means was very small when 
one considered the variability within each class. The overlap in the two 
distributions was quite large and, in the light of this fact, the small difference 
between means could reasonably be considered unimportant. 

Before leaving the discussion of the mean, we should point out that the 
sum from which the mean is computed is often as useful as the mean itself 
in indicating the general magnitude of a set of measures. Means have the 
advantage over sums in that the former can be compared when the number 
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of observations on which they are based varies. Sums can only be compared 
when the numbers of observations are equal, since a sum is directly a function 
of the number of values as well as their magnitude. 

When variances may be unequal but means and the higher moments are 
equal, it is defensible to limit the comparison of individuals or groups to a 
comparison of variances. A larger variance indicates greater differences 
among the responses of an individual or the responses of a group than the 
differences for the other individual or group. It is interesting to note that, 
instead of variances, sums of squares of deviations might be used for com- 
parison when N’s are equal. As was the case with means in relation to sums, 
variances can be compared when N "s are not equal whereas sums of squares 


of deviations cannot be compared. 

In comparing individuals or groups wit 
variances, a variety of interpretations is possible depending on the particular 
combination of values and the practical situation. When means are equal and 
variances are equal, or judged to be approximately so, the comparison offers 
no difficulties of interpretation. The conclusion that the individuals or the 
groups are alike is defensible unless there is some reason to believe that they 
differ with respect to higher moments. As has already been indicated, when 
means are unequal and variances as well as the higher moments are equal, 
it is reasonable to base one’s interpretation on a comparison of means; when 
variances are unequal but means as well as the higher moments are equal, 
Variances may be compared and interpreted. Values for means and variances 
may, on occasion, suggest that there is little or no overlap in the two sets of 
measures and that there is, consequently, a fairly substantial basis for the 
interpretation that one set of performances is superior to the other. The 
question of overlap can, of course, be answered much more conclusively 
by examination of the two frequency distributions or their histograms, and 


their overlap. ; 

With certain combinations of unequal means and unequal variances, an 
interpretation which goes beyond the statement of the several values may be 
unwarranted, Students should be aware that only those interpretations whose 
appropriateness is obvious, or very nearly so, or which can be reduced to 
terms in which they become obvious and command assent, are worthwhile. 
When an interpretation cannot be stated so that its correctness is obvious, 
Persisting in that interpretation only provokes unfruitful debate and 


argument. 


h respect to both means and 


Test Construction 


e employed extensively in the construction and 


The descriptive moments ar ) 
distribution of test scores can be given a 


Tevision of psychological tests. A 
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partial description by computation of its mean, variance, index of skewness, 
and index of kurtosis. It is possible to express each of these moments in terms 
of certain characteristics of the items in the test so that it becomes theoretically 
possible to select items to produce a test whose score distribution will have the 
desired moments. What we are suggesting is that the distribution of test 
scores can be modified and determined through the selection of items for the 
test. On occasion we may wish to produce any one of a large number of 
distributions of varied form. 

We may wish to produce a binomial distribution for which the number of 
score intervals is (n + 1) and the frequencies are given by the successive terms 
in the expansion of (1 + 1)", where 2" = N. A distribution of this type is 
symmetrical and unimodal, with smaller frequencies in the tails than in the 
center. As n becomes larger, the value of the index of kurtosis for the binomial 
distribution approaches three and the function approaches the normal curve, 
an important continuous function which will be discussed in some detail, 
later. Three examples of binomial distributions are given in Table 6.5. 


Table 6.5 
EXAMPLES OF BINOMIAL DISTRIBUTIONS 

X f X f 

n= 2: n= 10; 
os = "y. P = 10) 
0 1 0 1 
1 2 1 10 
2 1 2 45 
X £ 3 120 
m = t) 4 210 
N = 16 5 252 
0 1 6 210 
1 4 7 120 
2 6 8 45 
3 4 9 10 
4 1 10 1 


It is conceivable that we might sometime wish to produce a test which 
distributed respondents uniformly, or approximately so, over the score 
intervals as in a rectangular distribution, or one which differentiated between 
two parts of the distribution. Table 6.6. gives two examples: a rectangular 
distribution and a distribution in which the upper 15 per cent of respondents 
is clearly differentiated from the lower 85 per cent. 

In psychological testing, it is a common practice to assign a score of zero 
to the failing of a single test item and a score of one to the passing of a single 
item. Consequently, scores of a group of individuals on a single item may 
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Table 6.6 


DISTRIBUTIONS PROVIDING TWO 
KINDS OF DIFFERENTIATION 


X f Xx F 
(N = 50) (N = 48) 
0 10 0 3 
1 10 1 8 
2 10 2 30 
3 10 3 0 
4 10 4 7 


o moments only, the zeroth and 


vary, taking on values of zero and one. Tw 
ly a distribution of item scores. 


the first, are required to describe complete 
An example of a distribution of item scores is given below. 


I f 
0 60 
1 40 


In the example, M = 100 and the mean, M: = Sfl/N = (60 x 0 + 40 
portion of subjects 


X 1)/100 = 0.40. Notice that M; turns out to be p, the pro 
passing the item. V; the item variance, can be computed from the mean by 
the formula: 
y, = M(1 — Mà 
Of course, S; = y Vi. 
If we let M, — p = the proportion of subj 
(1 — M,) = q = the proportion failing the item, then 


ects passing the item, and 


Vi — PY 
and á 
S; — „pa. 


Describing Populations 


in the discussion of statistical inference and 


sampling distributions, the four descriptive moments are also employed in 
characterizing what are called population distributions. Population distribu- 
tions are the parent distributions from which random samples are obtained. 


We shall defer further comment on this topic. 


As we shall see later on, 


The Limitations of Description 


Even when a complete description of the data at hand is possible, research 
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psychologists are seldom satisfied with this kind of analysis and evaluation. 
In describing data, one ignores error but must admit that the original 
measures and any values computed from them were subject to uncontrolled 
influences or biases, which one would like to take account of, in a more 
rigorous fashion. If there was any selection of subjects, if the occasions of 
measurement were chosen in some arbitrary manner, if there were any 
environmental influences not intended by the investigator and not strictly 
controlled by him—and these conditions are almost inevitable in any psycho- 
logical study—then the presence of error and the possibility of bias in the 
data can be taken for granted. 

Because of the presence of biasing errors in his research, a psychologist 
prefers to analyze his data by methods which make it possible to estimate the 
magnitude of errors. It is conceivable that he could, on the basis of his own 
prior experience and the reported experience of others, make a judgment as 
to the size of errors and the possible size of the bias, and could reach con- 
clusions which, even though the steps in the judging process cannot be made 
explicit, and even though the process is private and lacking in standardization, 
would be correct and could be defended in the scientific community. Never- 
theless, there are far better methods and techniques available for estimating 
error and bias, and for making decisions about research data. We are referring 
to the procedures of statistical inference. 

In saying that statistical inference provides the best methods for taking 
account of error and bias, we must add a crucial qualification. Statistical 
inference, when it is correctly and appropriately applied, is superior to 
description or any other kind of evaluation, but when it is incorrectly and 
inappropriately used, it is inferior to a descriptive evaluation which has been 
supplemented by good judgment. That is to say, there are situations in which 
statistical inference ought not to be used and in which the best choice for an 
evaluation is simple, straightforward description. The consequence of an 
indiscriminate application of statistical inference is a failure in the attempted 
inference and a misrepresentation on the part of the researcher. Far too 
frequently, students come to believe that making a choice between inference 
and description is no more than exercising a preference for a sophisticated 
method over an unsophisticated one. Their position implies that no know- 
ledgeable researcher would ever limit himself to description. We cannot agree 
with a position which results, as it so clearly does in much of the published 
literature, in the misuse of statistical inference. There is no rigor in statistical 
inference when it is misapplied and, what is worse, the reader of a research 
report may be misled as to the nature and quality of the data behind the 
report by the presence of these elegant inferential techniques. 

It is our viewpoint that research should be planned, whenever it is possible 
to do so, to permit an analysis and an evaluation by the very best methods, 
which are undoubtedly those of statistical inference. It is also our viewpoint 
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that certain kinds of research data which are available to psychologists and 
educators are important and should be reported in their journals, but are 
suitable only for description plus an interpretation based on experienced 
judgment. To sum up, humble data deserve an unpretentious analysis and a 
simple, honest report. 

Statistical inference will be 
and 14. 


discussed at length in Chapters Il. 12; 13, 


PRODUCT MOMENTS 


Syntactics | 7 


Product moments constitute a class of values which can be computed from 
two or more sets of numbers when the following conditions are satisfied: 
there are N numbers in each set; there are N classes or categories consisting 
of one number from each set. The product moments of interest to us here 
are those computed from two sets of paired numbers. 

We let X; be the ith number in one set and Y; be the it" number in a second 
set, where i = 1, 2, 3,..., N. The pairing of two values, one from each 
set, is indicated by the common subscript, i. The (2N) numbers can be 
classified in the rectangular arrangement shown in Table 7.1. Each column 
in the Table represents a set of numbers; each row is a pair of values. 


Table 7.1 
PAIRED VALUES OF X AND Y 


Columns 
X E 
1 X Y, 
2 Xa Y: 
3 X; Y; 
Rows s 
i Xi Y; 
N XN Yn 


To compute a product moment for X and Y, we raise each X value to 
some power and each Y value to the same or a different power, obtain the 
product of the two powers for each pair of numbers, sum these N products, 
and divide the sum by N. A particular product moment is identified by the 
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two exponents employed, that is, by the power of X and the power of Y. The 
general formula for a product moment of X; and Y,—let us call it the mt? 
and nt product moment—can be written as 

EXPY? 

N 
It will be convenient to dispense with the subscripts and write simply 
Ax" Y^ 
d 


The Product Moment of First Powers of X and Y 


We shall be concerned here with the special case of product moments in 
Which m = n = 1. When both exponents are ^ one," the general formula 
becomes ; 
ZXY, 

N 

Computation of the product moment of first powers of X and Y involves 
summing the products of X and Y, and dividing the resulting sum by N. It 
may be helpful to refer these operations to the paired values of X and 
Y in Table 7.1. 

As a matter of convenience, we shall use the term “ product moment " to 
refer to the special case involving first powers of two variables, but it should 
be understood that the term has a more general definition. 


The Product Moment of v and y 


X — M, and Y to y by the 


Let us transform X to x by the formula, x — 
t of x and y is called the 


formula, y — Y — M,. The product momen 1 
covariance. It will be represented by the symbol, V... That is, 
Axy 
Vev= N (7.1) 


The Product Moment of Zx and Zy 


Consider x transformed to Z, by the formula, Z, = x/S,, and y trans- 
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formed to Z, by the formula, Z, = y/S,. The product moment of Z, and Z, 
is an important value, the product-moment coefficient of correlation or Fry- 
The formula is 


_ 22,2, 

Try = T ue 

The product moment of first powers of X and Y appears in certain com- 

puting formulas, but has no particularly important properties. The logical 

properties of the covariance, V,,, and the coefficient of correlation, ray, will 

be discussed next. In this initial discussion, no attention will be given to 
properties having to do with error or random variation. 


(7.2) 


Logical Properties of the Covariance 


By definition, 
2, 
N 


The numerator can be written to make explicit the contributions of the four 
possible combinations of positive and negative deviations of x and y. If we let 


Vey 


X, = positive X deviation, 
x. = negative X deviation, 
y. = positive Y deviation, 
y- = negative Y deviation, 


f, = frequency of pairings of positive deviations of X and positive 
deviations of Y, 

f,- = frequency of pairings of positive X deviations and negative Y 
deviations, 

f = frequency of pairings of negative X deviations and positive Y 
deviations, and : 

f = frequency of pairings of negative X deviations and negative Y 
deviations, then the numerator, Zxy, in which there are N terms, can be 
written 

Exy = Ex, ya + Lxyy_ + Exc Exy-. 

The number of terms in each of the four sums on the right is equal to the 
frequency with which those particular pairings of deviations occur. The first 
and fourth sums on the right will always be positive or zero. The second and 
third sums will always be negative or zero. Thus Z'xy and V,., will be positive 
when the absolute value of the sum of the positive products is greater than 
the absolute value of the sum of negative products; that is, when 


\(2x py, + Ex y2| > \(2x,y_- + Zx-y+) l. 


Sxy and V,, will be negative when 


x 


Bivariate Frequency Distributions 131 
xy. + Zxy)|« \(2x,y_ + 2x ys) n 
that is, when the absolute value of the sum of positive products is less than 
the absolute value of the sum of negative products. £xy and V,, will be 
zero when the absolute values of the two sums are equal, that is, when 
(Sxpy4 + Sxy)| = (Zx + 2x74) |: 
Notice that the absolute values depend not only on the magnitudes of 
deviations, but on the frequency with which the pairings occur as well. 


Bivariate Frequency Distributions 


Given two sets of paired numbers, X, and Y; the pairs can be doubly 
classified according to their values as shown in Table 7.2. The rectangular 
classification is called a bivariate frequency distribution. 


Table 7.2 
A BIVARIATE FREQUENCY DISTRIBUTION 


Xs 


of a bivariate frequency distribution might be 
dimensional figure having two horizontal axes, 
a single vertical axis for frequency. Instead of 
the two-dimensional rectangles employed in a histogram, we would have 
three-dimensional rectangular solids whose height would represent frequency. 

Although three dimensions are implied by bivariate frequency distributions, 
fine the figure to two dimensions. To do 
the coordinates of a point in the 
ts. The points may form any one 
ften scattered throughout the 


A graphical representation 
achieved by means of a three- 
one for X and one for Y, and 


it is usually more convenient to con 
so, one takes the pair of X and Y values as 
two-dimensional space and plots all N poin 
of a great variety of configurations, but are o 
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space, a circumstance which accounts for the designation of the figure as a 
scatter plot. 

Original values of X and Y can be used for plotting purposes, but it is 
instructive to transform each variable to deviations and to plot pairs of 
deviations, x and y. In Table 7.3 are 10 pairs of values of x and y. These 
deviations have been used as coordinates for the plotting of the 10 points 
in Figure 7.1. 

Table 7.3 
VALUES OF X AND Y 
TRANSFORMED TO DEVIATIONS 


(First Example) 


x y 

1 3 4 

2 1 0 

3 3 3 

4 2 1 

y 5 —3 —3 
Pairs 6 0 2 
7 —4 —3 

8 —3 —2 

9 —1 —2 

10 2 0 


The covariance, V,,, is computed from the deviations in Table 7.3 by 
summing their products and dividing by N — 10. Thus 


Zxp §2 
Vae ay gg nra 
Y 
Il (-,4) | 1 (e) 
| "X 
-5 +5 
IM (--) e IV(4.-) 


Fig. 7.4 Plot of points whose coordinates are the paired deviations in Table 7.3. 


The coordinate axes of Figure 7.1 define four quadrants which have been 
labeled in conventional fashion with Roman numerals. Beside each Roman 
numeral is the sign combination characteristic of the coordinates of points 
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in that quadrant. In the preceding section on logical properties of the co- 
variance, f... f... f and f__ were the frequencies for the four possible sign 
combinations arising from the pairing of x and y deviations. These four 
frequencies can be referred to Figure 7.1 and interpreted as the number of 
points plotted in each quadrant. In this particular example, there are no 
combinations of positive x's and negative y's, and no combinations of nega- 
tive x's and positive y’s. Consequently, although the points scatter somewhat, 
all of the points lie within or on the bounds of the first and third quadrants, 
and the trend is unmistakable. 

Another bivariate distribution is given in Table 7.4 and the points for the 
paired deviations are plotted in Figure 7.2. Examination of the distribution 
will reveal that the x and y deviations of Table 7.4 are the same as those of 
Table 7.3. Although the deviations are the same, the pairing is different. 
Consequently, the covariance is different and the plot of Figure 7.2 is different 
from that of Figure 7.1. Vz, for Table 7.4 is zero. 


Table 7.4 


VALUES OF X AND Y 
TRANSFORMED TO DEVIATIONS 


(Second Example) 


UNU = w 
| 


Pairs 


| 
I 
| 
KH ONWNANVON 


SeCmranuaune 
o 
| 


N 
| 
o 


-5 +5 


x 


Fig. 7.2 Plot of points whose coordinates are the paired deviations in Table 7.4. 
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A Computing Formula for the Covariance 


It is, of course, possible to compute V,, by transforming all the original 
X and Y values to deviations from the mean, obtaining the sum of the pro- 
ducts, and dividing by N. It is usually more convenient, however, to employ 
a formula expressing V, in terms of original values, X and Y. The derivation 
of such a formula is given below. 

By definition, 


2X, 
Va =H 


Consider the numerator, £xy. By definition, 
Zxy = 2(X — MjXY — My). 
Multiplication of the two binomials yields 
Sxy = (XY — M,Y — M,X + M,M,). 

Simplification produces 

Axy = EXY — M,XY — M,ZX + NM,M,,. 
Substitution of 2 X/N for M, and X Y/N for M, gives 
ZXZ£ZY ZXY , ZXZY 


oxy ZXY N N N 
Combining terms on the right yields the final computing formula, 
ZXZY 
Zxy = Axy — UN (7.3) 


Formula (7.3) is convenient for computing 2xy, the sum of products of 
deviations, from which V,,,, the covariance, can be obtained by dividing by N. 


Prediction Coefficients 


We shall define two coefficients of prediction or prediction constants, Dyz 
and bay as follows: 


V. 
By Y. s (7.4) 
and 


Vey 
Bey = “Tn, (7.5) 


These values are also called regression coefficients. 
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Notice that each prediction coefficient is the ratio of the covariance to one 
of the variances. A prediction coefficient is a constant value which can be 
employed, in an equation, to predict variability in one set of numbers from 
variability in the other. 

The constant, bys, is derived, as follows: Given paired values of x and y, 
we wish to find a value for the constant, bys, such that we can compute a 
value, y’, for each value of x by the formula, 

y= bysX. 
The computed value, y’, is called a predicted deviation. We wish to make the 
best possible predictions, that is, we intend that the values of y’ will, in some 
sense, be as nearly equal to the actual values of y as we can manage to have 
them be. With some sets of paired numbers, the value of the desired constant 
might be determined quickly, by inspection, as in the example in Table 7.5, 


Where it is apparent that, when by: = 3, the predicted values, y' = 3x, are 
identical with the actual values of y. That is, y' — y in each prediction. 


Table 7.5 


PAIRED DEVIATIONS WHICH 
PERMIT PERFECT PREDICTION 


y y y = 3x 
1 —3 —9 —9 
2 —1 —3 —3 
Pairs 3 0 0 0 
4 2 6 6 
5 2 6 6 


Note that by; = 3. 


nnot be chosen so easily and with such 
the paired values of Table 7.3 and also 
that there is no value for bys which 
d values exactly equal to the given 
gned to by, there will be errors in 
f prediction as the difference 


Usually the value of the constant ca 
perfection in the predictions. Examine 
those of Table 7.4. It should be apparent 
will permit the computation of predicte 
values of y. No matter what value is assi 
the predictions. We shall define an error 0 
between y and y’; that is, as 

O —») = 0 — brx). 
o choose a value for bys which will make the 
d, exactly, in what sense prediction can 
oked to define “ best ” is that the sum 
Il be a minimum. In applying the 


3 Keep in mind that we wish t 
* best ” prediction. We have not sai 
be the best. A criterion commonly inv 
of squares of errors of prediction sha 
Criterion, it is the quantity, 

Zy — yy = 29 — brx), 
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which is to be minimized. The expression can be written as 
A2(y? — 2b,,xy + bjx?) = Ly? — 2b,, Exy + b; LX. 
Reordering the three terms on the right gives 
b? 2x? — 2b,,Zxy + Xy. 


If we multiply the expression by Zx? and then indicate division by Zx?, we 
obtain 
bsp — 2b, Exe + Ext Sy" 

Ax* 
which has the same value as the original expression. We can also add and 
subtract the same quantity, ( Zxy)?, in the numerator without changing the 
value of the expression. Adding and subtracting (2xy)* yields 

bi (Ext) — 2b,,2xyZx* + (Zxyf — (xy? + BP Ly? 

25 
The first three terms of the numerator constitute the square of a binomial 
and the numerator can be rewritten making the binomial explicit as indicated 
in 


(b, Ex* — Zxyy — (Exyy + 2x? Ly? 
Sxe 
Now the value of the function, as it depends on the choice of a value for by., 
is determined by the expression! 
(b,, 2x? — £Xxyy, 
which, since it is a square, must be equal to or greater than zero. The function 
will be a minimum when 


(b,,Zx* — Sxy)? = 0 
in which case 
by, 2x? — Xxy =0 
and 
oxy 


Ex? (7.6) 


bys 
Thus it can be said that 
DOY — brx}? < Zy — ax)? 

when a # b,,. 

If we divide the numerator and the denominator of the fraction in formula 
(7.6) by N, we obtain 

ZOIN _ Vas, 
w SEIN Vo 


1The student may wish to review an earlier use of this method, in Chapter 4, to show 
that Z(X — M,)? < Z(X — a)? wherea + My. 


b 
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Formula (7.6) is a convenient computing formula for b,, if one first obtains 
2xy by formula (7.3), 


ZXLY 
Axy = ZXY — NC 
and £x? by formula (4.2), 
EY)? 
Ax = DXF (ay 


By similar procedures we could show that bzy, the constant for predicting 


x from y, is given by the formula, 


xy 
bey = Sy? (7.7) 
and that 
hey = 


Another Criterion for Prediction 


Consider again the prediction of y from x. We can write two covariances: 


One involving the actual deviations of x and y, 
Zxy 


Vay = 

and another involving the actual deviations of x and the predicted deviations, 
Exy 
Vey =N 

If we impose the condition that these two covariances are to be equal, we 


can solve for b,, to satisfy this criterion. We write 


Vay: = Vay 
from which it follows that 
ag Z9 
N N 
and 
Axy' = DX). 


Since y' = b,,x, we can substitute for y' and obtain 


ExbyX = ZXY. 
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The constant, 5,,, can now be written before the summation sign and both 
sides of the equation can be divided by £x? to yield 


Zxy 
Zx 
We observe that the same value for b,, satisfies both criteria. It makes the 


sum of squares of errors of prediction a minimum and the two covariances, 
Vz, and V,,,, equal. 


b, = 


Summary 


The prediction coefficient, 
Zxy 
~ Dix? 


is the constant in the prediction equation, 


Bis 


, 


Ü 


y! = byex. 
We may substitute given or actual values of x in the equation and compute 


predicted values of y', which have the property that the sum of squares of 
errors of prediction, 


y id Zo — yy, 
IS à minimum. 


The equation for predicting y from x can be written in terms of original 
values, X and Y. If we let y' = y’ — M ,and x = X — M,, then the equation, 
y' — byx, 

becomes, after substitution for y' and x, 


Y' — M, = b,(X — M,). 


Then 
Y= by. X = b, M, + M, 
and 
Y' = b,X + (M, — b, M). (7.8) 
Notice that formula (7.8) has the form of the general linear equation, 
Y = bX +a, 


Attempting the prediction of y from x by means of a linear equation is 

equivalent to attempting the expression of y as a linear function of x. The 
ji lg dar i à xo ; 

word " attempting " is used advisedly for the irregularities in the paired 


values often do not permit anything better than a crude approximation in the 
Prediction or in the expression of the relation. 


As regards b,,, the constant in the equation, 


x= bry, 
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Substituting actual values of y permits computation of predicted values of 
x’, which have the property that the sum of squares of errors in predicting x, 


X(x — xy, 
is a minimum. The equation can be written 
X' = by,Y + (Mz — bzyM y). (7.9) 


Formula (7.9) represents an approximation in expressing the relation between 
X and Y as a linear function. 


Graphical Representation of Prediction Constants 


After b,» has been computed for given paired values, the equation, 


vz 
y = byx, 

can be plotted by assigning two or more values to x, computing corresponding 
values of y’, and treating these pairs of values, x and y’, as the coordinates 
of points located in the scatter-plot of x and y. A straight line drawn through 
the points, (x, y), is called the best fitting line or the line of prediction. The 
Coordinates of any point on the line give a value of x and the corresponding 
Predicted value of y’. The line always passes through the point, (x = 0, 
y= 0. 

The prediction coefficient, bys, is computed below for the paired values in 


Table 7.3, 
Axy = 52, 
Zx = 62, 


and 


bye = 2 e 0.84. 


62 


Fig. 7.3 Plot of the best fitting straight line for the scatter-plot of Fig. 7.1. 
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We shall assign values of —4, 0, and +4 to x, and compute the corresponding 
values of y' by the formula: 
y = byex. 

The three resulting pairs of values, (x, y'), are (—4, —3.36), (0, 0), and 
(+4, +3.36). In Figure 7.3, these three points have been plotted in a repro- 
duction of the scatter-plot of Figure 7.1, and a straight line has been drawn 
through the three points. Notice that the points of the original scatter-plot 
do not fall on the line. If one draws a vertical line from any point to the line 
of prediction, that distance represents an error of prediction, 


(y —») = Y — bx). 


The constant, b,» = 0.84, is the slope of the line of prediction. It indicates 
the vertical change per unit horizontal change in the line of prediction. 
Notice that the slope has a positive sign. 

The prediction coefficient, b,,, for the values in Table 7.4 is 


Axy 0 
bis = Fs A 8 


Since b,„ = 0, all predicted values, y’, will be equal to zero and the line of 


prediction will be a horizontal line coinciding with the x-axis. It is interest- 
ing to note that, since b,, = 0, 


(y — bux) = 
asi vax) » 


Ay — b,,xy = Ly*, 


that is, the sum of squares of errors of prediction is equal to the sum of 
squares for the actual values of y. 


Another set of paired values is presented in Table 7.6 and plotted in 


Table 7.6 
PAIRED VALUES OF x AND y 


(Third Example) 


x y 

1 —1 2 

2 0 —2 

P 3 4 -3 
Pairs 4 2 2 
5 -3 3 

6 = 2 


Figure 7.4. The covariance is 
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Fig. 7.4 Scatter-plot and line of prediction for the paired values in Table 7.6. 


The prediction coefficient, b yz is 


Zxy | —3l 
LE ir aio 
The line of prediction has been drawn in Figure 7.4 through the coordinates: 
(—4, +3.64), (0, 0), and (+4, —3.64). 
. Notice that the points of the scatter- 
in the second and fourth quadrants, that the cov: 
the slope of the line of prediction is negative. 


plot of Figure 7.4 lie predominantly 
ariance is negative, and that 


Logical Properties of the Correlation Coefficient 


By definition, 
EZ, 
a= SC 
the product moment of Z, and Zy. If we substitute x/S, for Zs and y/S, for 
Z,, we obtain 


which can be written as 


and finally as 


fig (7.10) 


Formula (7.10) gives the first property of the product-moment coefficient 
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of correlation: The product-moment correlation coefficient is the ratio of the 
covariance to the square root of the product of the variances. Since the n^ 
root of the product of n numbers is the geometric mean of the n numbers, 
VV, is the geometric mean of the two variances. 
Formula (7.10) can be rewritten, as follows: 
| onm Zxy|N Zxy 


SVV NVV, JNV,NV, 
Since NV, = £x? and NV, = Ly’, 


Voy 


axy 
^ y 
Formula (7.11) is a convenient computing formula for r,,. The sum of 
products in the numerator, Xxy, can be obtained by formula (7.3). The two 
sums of squares in the denominator, Zx? and Æy?, can be computed by 
formula (4.2). 

Let us consider a second property of r,,. Imagine that we had two sets of 
paired values, X; and Y;, that we wished to predict variability in one set 
from variability in the other, but that we formulated the prediction problem 
in terms of Z, and Z, obtained by transformation from X; and Y;. If we let 


(7.11) 


Z, = an actual Z-score for variable Y;; 

Z, = an actual Z-score for variable Y;; 

Buz = the constant for predicting Z, from Z,; and 
Z, = a predicted Z-score for variable Y,; 


I 


then the prediction equation is 
i. Zy = By Z. 
An error of prediction is 
(Z, n Zi) = (Z, = By; Z.). 


The sum of squares of errors of prediction, the quantity which we wish to 
minimize, is 

NZ, — Zi) -xXZ,— BusZi 
If we apply the least squares criterion in solving for the value of f,,, just as 
we did in solving for b+, we find that 


2Z,Z. 
Bye = Y = Fry. 


Thus it turns out that the product-moment correlation coefficient is the 
prediction constant in the equation, 


Z, = Baz 
and also in the equation for predicting Z, from Z,, 


Z, = BeyZy. 
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A third property of rz, is given by the equality, 
Fey = d busbss, 
which can be expressed, as follows: The product-moment correlation 
coefficient is the geometric mean of the two prediction constants, bys and 


by. This relation can be proved by substituting 2xy/ Zx? for by, and Exy/ Ly? 
for b,, in the equality stated above. The substitutions yield 


r | Exy Zxy ACExy*: Xxy 
wy J ZeD JEE JZZ 


which is computing formula (7.11) derived earlier. 
A fourth property of rzy is indicated in the following equality statement, 


= lyy = l'as 
s: The coefficient, rey, is the correlation 
f either variable. Consider ry,. We 


ley 


which is to be interpreted, as follow: 
between actual and predicted values o 
can write 
sees 
wo JDP? 


Substituting b,,x for »' and simplifying yield 


" Ayby.x baly 00 TX) 0. we 
w = JESEAMSOA  JbIeip SZ 


A similar proof could be given showing that 
yy = lev: 
The fifth property of rzy has to do with two standard deviations: the 
standard deviation of actual values and the standard deviation of predicted 


values for either variable. In discussing the prediction constants, by, and bey, 
We defined y' and x’ as predicted deviations. Predicted deviations exhibit 


Variability which can be measured in terms of a variance and a standard 
deviation. The variance of predicted values of y is 


Xy" 
Weir in 

The standard deviation of values of y’ is, of course, 
Sy = J Vy. 


oefficient is the ratio of the standard 


The product-moment correlation c 
dard deviation of actual values of Y. 


deviation of predicted values to the stan 
That is, 
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which statement can be proved, as follows: 


s, dw, SIN _ SIF, 

Lem S, JV, J ÀyN Jay 

Since y' = b,,x, we can substitute and obtain 

JAEXbyx* — bux? — byy DX? l 
Vay 4 Xy Vay 

Substitution of Exy/ Lx? for bys gives 

Zxy Ex oxy 


Ext JE Exedy? 
The proof is concluded. A similar proof could be given that 


S. 


Fay = S 
The sixth property of r+, concerns the relation between Zy? (the actual 
variability in Y), Sy" (the predicted variability), and 2(y — y? (the 


variability of errors of prediction). The first step in determining the relation 
involves writing the identity: 


oy 


Voy 


yes y +  — FY 
Squaring both members of the identity gives 
Pay" + wy — y) tQ — yy. 
Summing N terms, on the left and on the right, and simplifying produce 
Zy = Zy" + Zyy(y — y) + By — yp. 


Examine the second sum on the right. We can remove the “2” 


, Substitute 
by;x for y', and obtain 


T 22b , x(y —byx) 
which is equal to 


2Àb,xy — 2Zb2,x® = 2b, Xxy — 2b? Xx, 


The quantity, Zxy/ Zx?, can be substituted for b,,, producing 


AZI)  A(Zxy) Zx? A Lxy)? 2(Zxyy 
Ax (Ex) (X9 E xu 


Thus the second term in the right member of the original identity can be 
eliminated, yielding the equality: 


Zy = By" xy — yy. 
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Dividing by N, we obtain 

V, =Vy t+ Vy-y- 
Thus it can be said that the variability in Y can be analyzed into two additive 
components: the variability which can be predicted from X and the variability 
which cannot be predicted from X. A similar statement could be made 


regarding the prediction of X from Y. 
In the discussion of the fifth property of rey, it was shown that 


— Sy _ Vr, 
Fay — S JV, 
Squaring reveals that 
Vy 
P =>" 
v 


If we take the equality, 
Vy = Ve + Vy: 


and divide both members by V, we obtain 


c= Vy + Vue 5 
Fy V, 

an equality statement which can be interpreted as saying that the whole s 
the variability in Y consists of two proportions: the proportion that can be 
Predicted and the proportion that cannot be predicted. Furthermore, since 
By = V/V, we can say that the square of the product-moment reponi 
Coefficient is the proportion of variability in one set of numbers that can be 
Predicted by a linear equation from variability in another set. (The quantity, 
r2,, is sometimes called the coefficient of determination.) 


Since i2, = V,[V,, we can write 


Vy-vy 
l= ri + 
and 
Ph 
l5. e 


We observe that (1 — r2) is the proportion of variability not predicted. (The 
value, (1 — r2,), is sometimes called the coefficient of nondetermination.) 
* ays E 
The seventh property has to do with the limits of ry. It was shown above 
that 


2 
1 — riy = V, 


from which it follows that 
Pey SA = fu): 
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Since variances are positive (or zero), V, y, and V, will be positive (or 
zero). Then (1 — 72,) must be positive (or zero). That is, 


(1 — 72, 2 0. 
Then 
leÓGm 
and 
Iz Jn. 


It follows that 

m) Srey S&S EM 
The product-moment coefficient of correlation is a dimensionless number 
having a value of —1, +1, or some value between these two limits. 


The eighth property of r,, is that the coefficient can be expressed as a 
function of the pair differences, Y — Y. Given two sets of numbers with a 
certain amount of variability in each set, then the smaller the variability of 
the pair differences, the greater r,, will be. The greater the variability of the 
pair differences, the smaller r,,, will be, algebraically. The proof follows. 

Consider the identities: 


A(x — y? = Bx? — 2xy + y?) 
and 
A(x — yy = Fx? — 2Zxy + y 
Transposing terms in the second identity gives 
22xy = Zx? + Ly? — Tx — yy. 
Dividing by two yields 


Zx + Sy? — Ux — » 


oxy 3 


Examine the quantity, Z(x — y)?, in the numerator. It is the sum of squares 
of differences between deviations. It can be shown to be equal to the sum of 
Squares of deviations of differences, Y — Y, from the mean difference, 
XX — YN. 

By definition, x = Y — M,andy = Y — M ,. Then 


£x — y = AC — M) — (Y — M,)P = ZX — Y) (M, — Mj. 


Let (Y — Y) = D, a difference. Then the mean difference, Ma, can be 
expressed, as follows: 


ID XAX-—Y) ZXX—xY XX yy 
mcm NON CN CUN TM. — My 


After substitution of D for (X — Y) and Ma for (M, — M 


u), We observe 
that 


A(x — yy = XD — M}? = Ze, 
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where d = D — Mi. Substituting Zd? for X(x — y) in 
Ax? + Ly? — Xx yr 


Axy = 2 
yields 
zs xy - xe 
Axy = x a 
We divide by N on both sides to obtain 
Ve t+ Vy — Va 
Vey = m RD 
Dividing on both sides by (Vz, yields 
B ONE Vat CE: P 
JV, 24V.V, E 


If we consider V, and V, fixed and Va allowed to vary, then it is evident 
that r,, becomes smaller in algebraic value as V, becomes larger, and that 
‘ey becomes larger in algebraic value as Va becomes smaller. 


Coding and Its Effect on 
Product-Moment Correlation 


Consider first adding (or subtracting) a constant. Let 
X'-X4a. 


We wish to examine the relation between rey and rsy: We can write 


Axy . 
When X' = X + a, My = Mz + a, and 
s (r4 d) — M dodi e JP Mem 


X 
I 
d: 
| 
z 
| 


Therefore, 
Axy 


Fay = Jxezy. i 
and we conclude that adding (or subtracting) a constant 
Product-moment correlation. 

Consider next coding by multiplying (or dividing) by a 
Xx’ =ax. 


ur 


does not affect the 


constant. Let 
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Again we wish to examine the relation between r,, and rwy. We begin by 
writing 
a 2xy 
A Ex" X, y? 
When X’ = aX, M, = aM,, and 


se X' — My = aX — aM, = a(X — M,) = ax. 


ley 


Then 

— xy a2xy Axy " 

Zeem  ajXGxy JID *” 
We'conclude that multiplying (or dividing) by a constant does not affect 

the product-moment correlation. 


Tuy 


Special Formulas for Product-Moment Correlation 


A special computing formula can be derived for the correlation between 


two variables, X and Y, when one of them is dichotomous. Consider the 
formula, 


oxy 
hy = 
2x2 Ly? 
where 
Ry = uy [EY 
N 
Bn» (E 
Z - DY E m 
and 
(ZY) 
Ey = pye E 
J Y N 


Let the variable X have only two distinct values, Y, and Y,. Let Y have r 


distinct values, Yi, Ys, Y3,..., Y;,. , and Y,. Then the bivariate frequency 
distribution will have the form given in Table 7.7. 


The derivation of a special formula is based on the fact that a dichotomous 
variable can always be coded to values of 0 and 1. Let X; be the coded value 
of X, and X;, the coded value of ¥,. The coding required involves subtraction 
of X, and division by (X, — X), as follows: 


NET? 

AU EL. des 
pon 
me cwm 
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After the two values of X are coded, the bivariate distribution can still be 
represented by Table 7.7 except that X, should be changed to X; = 0 and X, 
should be changed to X5 = 1. 


Table 7.7 


A BIVARIATE FREQUENCY DISTRIBUTION 
WHEN ONE VARIABLE IS DICHOTOMOUS 


fa f. Sums 
Note: fu + fa = N. 


er that the coding operations performed 


on the dichotomous variable have no effect on the correlation between the 
two variables. That is, rz, = "rw TO see the changes which can be effected 
in the formula for the product-moment correlation between X' and Y, 


Recall that it was established carli 


Zxy 
va = T > 
aty SEx” Sy? 
let us examine the quantities: ZX’, SX", and ZX'Y, in 
oap ee 
Ay = BEY ay 
and 
2 2 (XY 
boa P OS 


X" will be simply the frequency of X3, 


Since Y! = 0 and Vj = 1, the sum of 
he second column of Table 7.7. In 


Which is the sum of the frequencies in t 
Symbols, 


N x r r r r 
BX = E faXi t Z fiXa = Efa + fll) = Z Sie 
i + j j 
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Let us call the sum of the frequencies in the second column, f2. Then 


N r 
TA = Z fiz =f 
J 
The sum of squares of coded values, 2X”, will also be equal to the 


frequency of X;, which is the sum of the frequencies in the second column 
of Table 7.7. Thus 


N " r j2 r y r r r 
EX" = LNT + Ef = ZO + Zh)? Lf. = fa 
J J J J J 
£X" Y, the sum of products of X" and Y, is equal to the sum of the values 
of Y which are paired with X;. This quantity—let us call it Q,—is the sum 
of products of the frequencies in the second column and the Y values along 
the left margin of Table 7.7. That is, 
N r r 
Q, = ZX'Y = Z fnaXi Y; sp Z fi XiY;, 
J J 
but X, = 0 and X; = 1, therefore 
N r 
Or = ZPY- ZA. 
J 


We can now substitute f. for 2X’, f, for SX", and Q, for ZX'Y in the 
formulas for Zx'y and Zx”. After these substitutions, the formulas become 


and 


Then the computing formula for ry, when Y' is a dichotomous variable 
which has been coded to have values of 0 and l, can be written 


Qy — f3Z YIN 


rey m ——R 5 
Aa — FAN) y 
or as 
N = 
tery = Qs — faXY | (7.12) 


NNS — NFR) Xy? 
To compute r», by means of formula (7.12), the following values are needed: 
ZY = the sum of the N values of % 
Zy? = the sum of Squares of deviations for the N values of Y, 


Q, = the sum of the f., values of Y which are paired with Xj, and 
fa = the number of values of Xj. 


A product-moment coefficient of correlation computed by formula (7.12) is 
often called a point-biserial correlation coefficient. 
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mere = a special formula for the correlation between two variables, 
beu : n en both variables are dichotomous. Let the variable X have two 
Y, ani Y o pes A Let e variable Y also have two distinct values, 
kno. ey o ie requency distribution will have the form of 
m variables can be coded to values of zero and one as indicated below. 
y = (X, — XX — X,) =9; Xi = (%, — Xy)/(OG — Xj) — I. 
"n n Y)(Y2 — Y) = 93 Yi = (Y, — YK(Y — Y) = 1. 
oding does not affect correlation, rz; = zv. 
LR the coding of both variables, the bivariate frequency distribution can 
represented as distribution B in Table 7.8. 
a compute roy we need the quantities: 2X’, my" ZX'Y' JY and 
* - Since X! —0, X5 — 1 Yi — 0, and Y; = l, 
"d = fix ae faki = f0) + fol) = fa» the frequency of values 


Z^ = f? 4+ LAKE 0 09 + f? = So 
Y à = fo, and 
ZY’ = fa. 
Table 7.8 
A BIVARIATE FREQUENCY DISTRIBUTION 
FOR TWO DICHOTOMOUS VARIABLES 


B 


Yid fa p fs. 


faa 
N — fn fs + fu + fra 


of the coded variables. We observe that 
+ faX2Y, =0+0404-fex 
the frequency with which values 


y puer next the sum of products 
Theis, eUfuaxXlY. TfaXi Y; FfaiYi 
of y: m of products, XX'Y',is equal to fs», 

are paired with values of Ys. 
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Substituting f. for ZX’ and SX", f, for SY’ and zy", and Sos for 
2X'Y' in the formulas for sums of squares and products of deviations yields 


EX mf NO 
8 f$. 
Ay —f— N , 
and 
iy Saf. 
ZEx'y = fa — (NC 


When these values are substituted in formula (7.11), the formula for ry, 
becomes 


; fas — fa falN E 
oY as — FAINYs. — FEIN) 


0 Ms — faf 

AOI. — SIN.. — £2) 
By substituting ( fi, + fi, + for + foo) for N, (fis + fas) for f.o, and (fs + 
33) for fo, in the numerator of formula (7.13), (f... + f2) for N in the one 
factor and (fı, + fa.) for N in the other factor of the denominator, and by 


carrying out the multiplication and combining of terms, we can obtain a 
computing formula, 


or 


(7.13) 


Fay) 


oun fuf — fif : 

s Wah chi fe. 
Inspection of formula (7.14) will reveal that, when both variables are 
dichotomous, the computation of the product-moment correlation coefficient 
requires only the frequencies for the four possible combinations of values, 


11 f2» fia, and foy, and the frequencies for each variable: fa and fy for X; 
fi. and fy, for Y. 


(7.14) 


Other Transformations; Their Effect on 
Correlation 


A nonlinear transformation of X, such as 
Y= xX, 
may, on occasion, improve the prediction of a variable, Z. Given the paired 
values of X and Z shown in Table 7.9, the product-moment correlation 
coefficient, r,+, is 0.985. If Y is transformed to Y by the equation, Y — X?, 
then the paired values for Y and Z are those shown in Table 7.10. The cor- 
relation coefficient, r,., is 1.00. The explanation of the improvement in pre- 
diction is that the relation between Y and Z is not linear and the transforma- 


— 


Other Transformations; Their Effect on Correlation 153 


tion has the effect of producing a linear relation between Y and Z. That the 


Table 7.9 


PAIRED VALUES ILLUSTRATING 
A NONLINEAR RELATION 


X zZ 
1 20 20 
2 .30 .30 
3 .40 44 
4 .50 .62 
5 .60 84 
6 -70 1.10 
7 .80 1.40 
Table 7.10 


PAIRED VALUES FROM TABLE 7.9 
AFTER TRANSFORMATION 


Y-2x Zz 
1 04 20 
2 09 30 
3 16 44 
4 25 +62 
5 .36 84 
6 49 1.10 
7 64 1.40 


trend in the scatter-plot is rectified is apparent from an inspection of the plot 
of X and Z before transformation and the plot of Y and Z after transforma- 
tion. The plots are shown in Figure 7.5 and Figure 7.6. 


1.60 


20 — — — 


x 


Fig. 7.5 Plot of the paired values from Table 7.9. Notice the curvilinear trend. 


A nonlinear transformation, such as 
Y= /X, 
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will also sometimes rectify a nonlinear trend and improve the prediction 


1.60 


294—41————À — 1 
C4 - 64 


Fig. 7.6 Plot of the paired values from Table 7.10. Notice the trend is linear after trans- 
formation. 


involving a simple linear equation. Given the paired values of W and X 
shown in Table 7.11, for which the correlation is 0.987, the transformation, 


Table 7.11 


PAIRED VALUES BEFORE AND AFTER 
TRANSFORMATION OF ONE VARIABLE 


Ww X Y=VX 
1 2 4 2 
2 3 9 E 
3 4 16 4 
4 5 25 5 
5 6 36 6 
6 7 49 Ji 
8 E 
" 
. 
LJ 
w 
. 
1 — — —N 4 
9 x 49 


Fig.7.7 Plot of W and X for the values in Table 7.11. Note that the trend is curvilinear. 
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Y = A X, yields values of Y which are linearly related to W with a correlation 
of 1.00. Values of Y are also shown in Table 7.11. The plot of W and X 
before transformation is given in Figure 7.7; the plot of W and Y after 
transformation of Y is given in Figure 7.8. 


Fig. 7.8 Plot of Wand Y for the values in Table 7.11. Note that the trend, after trans- 


formation of X to Y, is linear. 


Substitution of Ranks 


ed for computing the product-moment 
bstituted for the original values of the two 
variables, X and Y. Given N pairs of values with N distinct values of X and N 
distinct values of Y, the first N ranks will be required in the substitution for 
each variable. It follows that the variance of the ranks will be the same for 
both variables and will be the variance of the first N integers. 

In Chapter 4 it was stated that the first N integers have a mean of (N + 1)/2 
and a variance of (N? — 1)/12. The formula for the product-moment coefficient 
computed from differences was earlier found to be 


„rtm Ks. 
w BAEN a 
If we let R,, be the product-moment correlation between two sets of ranks, 
Vg be the variance of the first N integers, and Vp be the variance of the 
difference between paired ranks, then 


R 2Vg — Vp i Vp 1 Vp 6Vp 
"CC" VR ^ ——2PVR XN? — 112 Ne —1' 


A special formula can be us 
coefficient after ranks have been su 


156 Product Moments 
If welet D — R, — R,, a difference between the two ranks in any pair, then 
the variance of the differences, Vp, will be defined as 
2D — Mp? 

N 
where Mp is the mean difference. Then 
ZR, - R) ZR, ZR, 

N N N 


since the means of the two sets of ranks are equal. Vp, which was defined 
above, now becomes 


Mp 


2D? 
UN . 
Substituting 2D*/N for Vp in the formula for R,, above yields the computing 
formula in final form: 
6Xp? 
NN? — 1) 
It should be apparent that computing R,, for two sets of paired ranks, 
when there are no ties in rank, is exactly equivalent to computing the product- 
moment correlation coefficient from the paired ranks. R», is no more than a 


convenient computing formula for product-moment correlation involving two 
sets of ranks with no ties. 


d. en (7.15) 


Restrictions on the Magnitude of ry, 


If the number of distinct values of X is not equal to the number of distinct 

values of Y, then 
—l € fey < +1 

and ray cannot equal either —1 or +1. That this restriction must obtain 
becomes apparent when one considers the predictions and their accuracy. If 
the number of distinct values of X is less than the number of distinct values 
of Y, then predictions from at least one value of X must be in error, for at 
least two different values of Y will be associated with that one value of Y. 

An example is given in Table 7.12. The number of distinct values of X is 
4 and the number of distinct values of Y is 8. Although a plot of the points 
would show a marked linear trend and although no other pairing of X and 
Y values could produce a larger covariance, the correlation is not perfect. It 
is, of course, very high, being 0.976. 


When both variables are dichotomous, ry can be computed from formula 
(7.14) which was derived earlier, 


fa " fuf — fafai à 
” VAASA 
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The magnitude of r,, is restricted when f2 # fs. in which case it will also be 
true that f, zz fi. since 


fa +f2=N =f. Tf. 


Table 7.12- 


PAIRED VALUES ILLUSTRATING 
A RESTRICTION ON rzy 


X Y 
1 1 1 
2 1 2 
3 2 3 
4 2 4 
5 3 5 
6 3 6 
7 4 7 
8 4 8 

rey = 0.976 


Examination of the 2 x 2 tables in Table 7.8 will reveal certain interesting 
relations among the frequencies. When f; < fz» then the largest value 
possible for fə» the value which will make rz, as large a positive value as 
possible, is f» in which case fis = 0 and fi, =fi1- Formula (7.14) can then be 
written 

- fife Afi fa (7.16) 
i Md fa fa fi. fs. A fa fs. 


Formula (7.16) gives the maximum positive value of rey when fa < fo. 


Exercise 7.1 


1. Compute the 10 possible covariances for the five variables given in Exercise 4.1. 


Example: 
ZV = 1,167 
ZW = 1,723 
ZVW = 100,712 
7 
3w = aym- 22” = 100718 (1,167),723) L 16534 155, 
N 24 
Po 
Vow = =r = 705.46 


2. Compute bzy, bzw» Dixy and bzy- Plot the equation for predicting Z from each 
of the other four variables. 
Example: 
Vow = 705.46 (from above) 
V, = 835.40 (from the example in Exercise 4.1) 


= 0.844 
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The prediction equation in terms of original values is 
W' = by,V + (Mw — bu,M;). 


The prediction equation written in terms of original values was derived earlier and 
presented as formula (7.8). To plot the equation, one selects three values for V 
and computes the corresponding values of W’. The resulting three pairs of values 
are then used as the coordinates of three points which can be plotted. A straight 
line can then be drawn through the three points. 

We shall use, as three values for V, 10, 50, and 90. M, is equal to 48.62; M, is 
equal to 71.79. The prediction constant, bw», is equal to 0.844. By substituting 
these values in the prediction equation given above, we obtain the three values of 
W': 39.19, 72.95, and 106.71. Thus the coordinates of the three points are (10, 
39.19), (50, 72.95), and (90, 106.71). The points have been plotted in Figure 7.9. 


134 


3c 


0 
v 100 


Fig. 7.9 Plot of the prediction equation in the example of Exercise 7.1. 


3. Compute the 10 possible intercorrelations for the five variables given in 


Exercise 4.1. 
Example: 
ZV = 1,167 
ZV? = 76,795 
ZW =1;723 


ZW? = 143,237 
ZVW = 100,712 


N = 
Formula (7.11) is - 
Zyw 
Tow =v a Swi 
(EV) (1,167)* 
Z = Ep? = ; 
N 76,795 24 = 20,049.62. 
(ZW (1,723)? 
Zw = ZW? — ss H : 
N 143,237 24 = 19,539.96. 


ZVZWw 
Zvw = EVW — N 7100712 CIS an — 16,931.12. 
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Then 
i 16,931.12 
is 4/(20,049.62)(19,539.96) 


0.855. 


A Look Ahead 


In the present chapter, product moments were introduced and certain 
special values, including the prediction constants and the coefficient of cor- 
relation, were discussed. The effects of coding on correlation were examined 
and some special formulas for product-moment coefficients were presented. 
Throughout the entire discussion of the present chapter, in every considera- 
tion of the predictive relation or of correlation, only two variables were 
involved. In Chapter 8, which is next, the methods of prediction and correla- 
tion are extended to more complex relations involving three or more variables. 
The three major topics of Chapter 8 are partial correlation, multiple predic- 
tion, and multiple correlation. 


PARTIAL AND MULTIPLE 
CORRELATION 


Syntactics | 8 


A partial correlation coefficient is the product-moment correlation of two 
sets of paired errors of prediction. For example, one might have the classifica- 
tion of values shown in Table 8.1 and carry out two sets of predictions: the 

Table 8.1 


A CLASSIFICATION OF VALUES 
IN N ROWS AND THREE COLUMNS 


Columns 

Ww X x 
1 W, X; Yi 
2 W: Xs Y. 
3 W, Xs Y; 

Rows , = 

i W, Be Yi 
N Wn XN Yn 


prediction of Y from W and the prediction of X from W. These two attempts 
at prediction would result in two sets of paired errors of prediction: the 
errors, (y — b,4w), in predicting Y from W, and the errors, (x — b,,w), in 
predicting X from W. There would be N errors of prediction in each set and 
these errors would be paired because two predictions would be made from 
each of the N values of W and consequently two errors would be associated 


with each value of W. The two sets of paired errors of prediction could be 
arranged as indicated in Table 8.2. 


We may now ask to what extent the: 
If we let ray., be the product- 
prediction, (x — b, 
can be written 


se two sets of errors are correlated. 
moment coefficient for the two sets of errors of 
ww) and (y — b,,w), the desired product-moment formula 


A(X — ba wy — byw) 
4X — Bou SQ — bus 


Fzy.w 
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The possibility of expanding the expressions in this formula, simplifying 
them, and then substituting for the constants should be apparent. The 


Table 8.2 
A CLASSIFICATION OF TWO SETS OF ERRORS OF 
PREDICTION 
Columns 
Errors in predicting X from W Errors in predicting Y from W 
1 Qa — bzuWi) Qi — byw) 
2 (Xa — brwW2) We — byuw2) 
3 (xs — brwWa) Qs — bywWs) 
Rows " i 
i (x; — bzw) Wi — by") 
N (xy — bewWy) On — By) 


derivation will not be given here but the convenient computing formula 
which results is 


r 
faye = TA 
ayw Vd 


Notice that formula (8.1) contains three values wh 
the original data: the three product-moment corri 
Y, and W. 

The partial correlation coefficient, zy.» 
between variability in XY not predictable 
predictable from W. The symbol, rsy.w, is often re 
X and Y independent of W.” 


r wyw 

e e (8.1) 
EX — rho) 

hich must be computed from 
elation coefficients for X, 


is the product-moment correlation 
from W and variability in Y not 
ad “the correlation between 


Multiple Prediction with Deviations 


The simple linear prediction equation, 
y’ = byx 
which was discussed in Chapter 7, involves only two variables: Y, the 
variable to be predicted and X, the variable from which predictions are to be 
made, The prediction constant, bys, is given a value, Zxy/ Zx?, which can be 
computed from the paired values of Y and Y, and which minimizes the 


quantity, 
Sy — yy = ZO — bux}, 


the sum of squares of errors of prediction. 
Linear prediction equations can be formulated for more than two variables. 
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Suppose we were given a classification of the variables, X, Y, and Z, similar 
to the classification of Table 8.1, and wished to predict Z from X and Y, 
simultaneously. For convenience, let us call Z the criterion variable, and X 
and Y the predictor variables. The prediction equation can be written 


z = b,x + by 
where 


N 
I 


a predicted deviation for the criterion variable Z, 

— an actual deviation for predictor variable Y, 

— an actual deviation for predictor variable Y, 
- the prediction constant for variable X, and 
y = the prediction constant for variable Y. 

Then an error of prediction can be expressed as 


Soy Xk 
ll 


(2 — z’) = (z — bx — by) 
and the sum of squares of errors of prediction can be expressed as 
A(z — zy -Z-—bx-—by,yy, 


which is the quantity we will want to minimize by a proper choice of values 
for b, and b,. The procedure involved in applying the least squares criterion 
here is analogous to, but more complicated than, the procedure in two 
earlier applications: the demonstration that the sum of squares of deviations 
from the mean is smaller than the sum of squares of deviations from any other 
value, and the demonstration that the sum of squares of errors in predicting 
Y from X, when b,, = Xxy/ Zx?, is smaller than the sum of squares of errors 
when 5,, takes any other value. 

Steps in applying the least squares criterion in the multiple prediction 
problem will not be given here in complete detail. In general terms, they 


involve showing that the magnitude of the sum of squares of errors of 
prediction, 
2(z — bax — b,yy, 


depends on b, as indicated by the expression, 


(b,2x* — Xxz + b, 2 xy), 


which must be equal to or greater than zero. 


I s The sum of squares of errors 
will have its smallest value when 


(b,ZÀx* — Zxz + b, Zxy} = 0, 
in which case 


b,2x® — Xxz + b,Xxy =0 
or 
b.Ex* + b,Xxy = Xxz. (8.2) 


Note that equation (8.2) contains the unknown constant 


b, in addition to bẹ 
and that consequently it cannot be solved. 


Ww —— 
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The sum of squares of errors of prediction can also be written in a form 
which reveals its dependence on b,, as indicated in the expression, 

(b,Ey? — Zyz + b,Exyy, 
which quantity must be equal to or greater than zero. The sum of squares of 
errors will, of course, be as small as possible when 
(b, Zy? — Zyz + b,Zxyy = 0, 
in which case 
by Sy? — Eyz + b,Exy =0 


or 
b,Xxy + b; Zy? = Zyz. (8.3) 


Note that equation (8.3) contains two unknowns, b, and b,, as does equation 


(8.2). 
Equations (8.2) and (8.3) are rewritten below as (a) and (b). 


(a) b, Zx? + by Sxy = Dyz 
(b — b.Zxy + b,Ey = Zyz 
Xxy, 2xz, and Ayz, are values which can be 
computed from the original classification of the three sets of numbers. To 


obtain values for the constants, bz and b,, this set of simultaneous equations 
in two unknowns must be solved. The student should examine a similar 


solution which was presented in Chapter 3; T . 
The equations can be solved by eliminating by and obtaining a single 


equation containing one unknown, b,. The steps in the elimination are, as 
follows: 

Multiply (a) by Zy? to obtain 

(c) bp Ext Dy + bxy Zy’ = xz dy. 

Multiply (b) by Exy to obtain 

(d) ba( Exp)? + by SxyZy* = xy Zyz. 
(c) to eliminate b, and produce 


The quantities, Zx*, Zy’, 


Subtract (d) from 
(e) ba Sx*Ly* — ba(Zxy)} = Zxzày | — ZExy Zyz. 


Factor the left member of (e). 

BISER — (Xxyyl- ZxzZy? — LxyXyz. 
Solve for b,. 
Ar DxzSy? — LxyLyz (8.4) 
b. = Deby? — (Exyy 


Formula (8.4) can be written 
zz^ V 


NE 
by Va 
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By means of a similar set of operations applied to equations (8.2) and (8.3), 
one can eliminate b, and show that 

AyzXx? — XxyZXxz 

A c Lx Ly® — (Xxyy 


(8.5) 
or 
Vy — VayVne 

VV, — Và, 


Formulas (8.4) and (8.5) can be used for the computation of b, and b,, the 
two prediction constants in the linear prediction equation, 


b, = 


z’ = b,x + byy. 


Multiple Prediction with Standard Scores 


Consider the prediction of Z from X and Y by means of the linear equation, 
Zi = BaZa + B, 


Notice that the three variables have been transformed to standard scores and 
that the two prediction constants are now denoted by the Greek letter f 
(“ beta ”) with appropriate subscript. 

An error of prediction is written 


(Z: — Z:) = (Z: — B.Z, — B,Z,). 
The sum of squares of errors of prediction is 
AZ, i Z)? = 2(Z. D B.Z. = BZ}, 


which is the quantity to be minimized. Applying the least squares criterion 
for obtaining values of B, and B, yields the set of simultaneous equations: 


BZZ} + B,ZZ,Z,— ZXZZ, 
B.ZZ.Z,-- B, ZZ? d M 
Dividing each equation by N yields 
B.ZZ3IN + B,2Z,Z,/N = ZZ,ZJN, 
B,£Z,Z,/N + B,EZN = XZ,ZJN. 


Since ZZ?/N = XZ*IN = |, 2Z,Z,[N = ry 2Z,Z,[N = ry, and 2Z,Z,/N 
= fya Substitutions can be made and the equations can be written 


B. + Byrey rus 
Bits + B, = fy; 


Il 
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The equations contain two unknowns: the prediction constants, B. and B,. 
The known values in the equations are the product-moment coefficients: rey, 
rs, and ry4 which, in any specific problem, would either be given or be 
computed from the original classification of the three sets of numbers. 

One might solve for B, and B, by the same methods employed in the 
preceding section in solving for b+ and b,. The procedure for eliminating one 
unknown, f,, and the resulting formula for the other unknown, f,, are 
presented below. 

Given the equations, 


(à — Be Birey = rs; and 
(b — Batay + By = rus 
the steps in eliminating B, and solving for f, are, as follows: 
Multiply (b) by rz, to obtain 
() Barby + Baray = Pav! ue 
Subtract (c) from (a) and obtain 
(d | By — Barby = ras — Veuve 
Solve for B, by factoring on the left and dividing by the coefficient of p.. 


BA = rey) = las — lauu 


M id 8.6 
Bec iy (8.6) 


By similar operations we could show that 


a a 8.7) 
— Try 

Where only two predictor variables are involved and, consequently, only 
two prediction constants must be determined, then the elimination of one of 
the unknowns by appropriate arithmetic operations which produce a single 
equation in one unknown is not at all difficult. When the prediction equation 
involves three or more predictor variables and a corresponding number of 
prediction constants, these fundamental arithmetic procedures, being quite 
lengthy and tedious, are not usually employed and formulas for the constants 
are not generally available. When there are three or more unknowns, two 
other computational approaches are widely used. o , 

In the older approach, which is appropriate for computation involving 
desk calculators, the detailed computing instructions devised in 1878 by 
Doolittle, an engineer with the United States Coast and Geodetic Survey, Or 
modifications of his instructions, are employed. These detailed instructions 
cleverly take advantage of certain standard features of the simultaneous 
equations in multiple prediction problems. Repetition in notation Is reduced 
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to a minimum and the entire computation is condensed and given a compact 
tabular representation. Even with the routinization and condensations of 
these procedures, however, the solution for three or more unknowns is still 
burdensome. For this reason, the Doolittle instructions will not be repro- 
duced here. The student who may require these detailed computing instruc- 
tions for a multiple prediction problem involving three or more unknowns 
can find them in a number of texts including those by Guilford (1, 406-409), 
Johnson (2, 330-337), and Kenney and Keeping (3, 298-301). 

In the newer approach, one formulates the fundamental solution in terms 
of matrix algebra and has the burdensome computation done on an electronic 
computer. To demonstrate the nature of the computer solution, we shall 
employ the simple problem involving two predictors and two unknown pre- 
diction constants. The student should understand that one would not put 
such a simple problem on a computer. The simple problem is used here 
simply for convenience in introducing the student to matrix notation. 


Matrix Notation 


For purposes of discussing matrix notation, we shall choose the problem 
of predicting variable Z from variables X and Y, when all three have been 
transformed to standard scores, in which case the simultaneous equations 
involve product-moment correlations and the unknowns, B. and f,. The 
application of matrix notation to the equations involving sums of Squares 
and products, and the unknowns, b, and b,, follows the same principles. 


The simultaneous equations for the problem in which Z, is to be predicted 
from Z, and Z, have been shown to be 


B. a Puts = fen and 

Bess + By = 
It is interesting to note that these vari 
displayed in tables or matrices. The coefficients of Pz and B, can be displayed 
in a matrix of two rows and two columns, a 2 x 2 matrix. Notice that the 
unit values in the diagonal cells are actually coefficients of 8, and B,. The 
prediction constants themselves can be placed in a 2 x 1 matrix and the 


correlations with the criterion variable Z can be arranged in a third matrix 
of two rows and one column. The three matrices are shown below. 


ious quantities can be separated and 


B. r 
B, r 
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Assuming that you were given the task of computing the correlations, Fez 
and r,, in the 2 x 1 matrix on the right, from the information in the other 
two matrices, what system would you use? One system, which has now 
become a standard procedure, is to take the entries from the tables and com- 
pute with them according to the instructions contained in the equations 
themselves. In the first equation, 


Be + Byrsy = Tan 


one is instructed to obtain the product, B,(1), and the product, Burzy, and 
to sum these products, obtaining rz:- In the second equation, 


Bzrsy + By = rus 


one is instructed to obtain the product, 8,r2y, and the product, B 
sum these products, obtaining ry:- 
Examine closely how the operation o 
arrangement of entries in the tables. It amounts to pairing entries in a row 
of the matrix at the left with entries in the column of the middle matrix, 
obtaining the product for each pair, and summing these products to obtain 
an entry in the matrix at the right. This operation of summing products of 
row and column entries in two tables is called matrix multiplication. The 
Operation is expressible in a very compact notation when there is no need to 
show the tables themselves and the values they contain. For example, if we 
call the matrix on the left, P, because it contains the correlation between the 
two predictor variables, and call the matrix in the middle, B, for the B-values 
in it, and call the matrix on the right, C, because it contains the correlations 
with the criterion variable Z, then the entire matrix multiplication can be 


represented as 


(1), and to 


f summing products relates to the 


PB=C. 


The student may find himself at a loss, initially, to react to such abbreviated 
Notation. It may help to suggest that, in looking at PB = C, he should say 
to himself, “ P is a table of numbers; B is another table of numbers. I am 
instructed to pair the entries in each row of P with the entries in each column 
of B, to obtain the product for each pair of values, and to sum these products. 
Carrying out this operation for a given row of P anda given column of B 
yields a sum of products which is one entry ina cell of table C 

All that we have done, up to this point, is explore, in a limited fashion, the 
matrix representation and notation for the simultaneous equations in the 
multiple prediction problem. We have shown that, given P and B, we could 
Obtain C, but finding C is not the problem we ordinarily have to solve. The 
problem we actually have can be stated, as follows: Given the correlations, 
Tey Fan and ry, compute Bz and p,. That is, given P and C, find B. 

Let us examine the two formulas for B, and B,, (8.6) and (8.7), and we shall 
see that, when these two equations are written in matrix form, the computation 
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does then correspond to the problem we are interested in solving. Formulas 
(8.6) and (8.7) are given below, with the direction of the equality statement 
reversed. 


Vos — Vayl ye _ B 

l —rv2, * 
elas H Fus B 
1— ri, P 


The two equations can be rewritten with two terms in the left member of 
each, both terms having a common denominator, as follows: 


The equations can be rewritten again to emphasize the entries for matrix C, 
rz: and ryz, and the coefficients of these values, as follows: 


1 Png 
[a] [- rg - ne 


zy. 


_ Tey 1 
[- rige tg] = 


These various quantities can be tabulated in the three matrices below. 


Faz Pa 
Fyz B, 


Given values for the entries in the matrix at the left and values for r,, and 
Tye the entries in the middle matrix, we could, by the process of matrix 
multiplication, compute B, and B,, entries in the matrix at the right. The 
student should be able to identify the middle matrix as C and the matrix on 
the right as B. The matrix on the left is obviously not P, as originally written, 
but does contain entries which can be computed from entries in P. The matrix 
on the left is called the inverse of P and is conventionally denoted by the 


symbol, P~, Computing entries for P-! from entries in P is called matrix 
inversion. 


‘Unfortunately the student will probably not be able to generalize the computation of an 


i Simple example given here to larger problems. A discussion making 
this generalization possible is beyond the scope of the present text. The interested student 


will find an excellent general treatment of matrix inversion in F. E. Hohn, Elementary 
Matrix Algebra (New York: Macmillan, 1958). 
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The problem of solving for values of B. and B, can now be expressed in 


the matrix notation: 
PAC = B. 


In reacting to this symbolism, it is suggested that the student say to himself, 
** P-1 (P inverse) is a table of values computed from P, another table con- 
taining the correlation between the two predictor variables. C is a table con- 
taining the correlations between the predictor variables and the criterion 
variable. I am instructed to pair the entries in a row of P^? with the entries 
in a column of C, to multiply the paired entries, and to sum these products 
obtaining a prediction constant to be entered in table B." 

All that has been said above about the simple prediction problem involving 
a total of three variables holds for larger problems. Given a specific problem, 
the student would construct two matrices: P, a table of intercorrelations for 
the predictor variables, with l's as the diagonal entries, and C, a table of 
correlations between the predictor variables and the criterion. He would take 
matrix P to a computation center and ask to have the matrix inverted.? The 
computer service would return to him the inverse matrix, P-}, The student 
could then perform the matrix multiplication, 

PAC = B, 


to obtain the f’s.? 


The Prediction Equations 


and wishes to employ prediction equations 


If one computes B, and By, 
erp » btainable by the formulas: 


involving 5, and b,, the latter values are o 
S: 
Ba es S. (8.8) 


and 


by = Pg 8.9) 


The prediction equation for deviations, 
z = bx + by)» 


can be written in terms of original values, X, Y, and Z, as follows: 


(Z — M,) = bX — Ms) + b (Y — M) 


*Programs for matrix inversion are found in all computer libraries. 
It is possible, of course, to have the entire computation done by the computer M 
Entries in the original matrix of intercorrelations for all variables, a combination of P an 


C, could be obtained on the computer, and the final multiplication, P-'C, could also be 
done on the computer. The procedure was presented above, broken down into its major 
Phases, so that the student's understanding and appreciation of it might be increased. 
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or 


Z' = bX +b, Y — bM, — b,M, + M, 


Multiple Correlation 


The coefficient of multiple correlation, R», is defined as rz,, the product- 


moment correlation between actual values of Z and predicted values of Z. 
It is computed by the formula, 


Rey = s Batas F Burg: (8.10) 


Rizy is the proportion of variability in Z predicted, simultaneously, from 
variability in X and Y. The quantity, (1 — A2,,), is the proportion of 
variability in Z not predicted from variability in X and Y. 

Exercise 8.1 


1. Compute Fzy,w the correlation between X and Y independent of W for each 


of the correlation matrices given in Table 8.3. Use formula (8.1). Study the pattern 
of relations in each problem. 


Example: 
Given ry, = —0.50, rey = 0.50, and ry, = 0.40, then 
Lo Pu Pastas 40 — (—.50)(.50) 
UC X4X0-750—75) JG = 33 — 35) 


2. Compute the multiple correlation coefficient, R; ,,, for the data in Exercise 4.1. 


Use formulas (8.6) and (8.7) to compute the beta-coefficients, B, and B,, and 
formula (8.10) to compute R 


0.867. 


zey 


Table 8.3 
FOUR 3 x 3 CORRELATION MATRICES 


Ww X Y 


Multiple Correlation m 
hien the coefficients, b, and b,, by means of formulas (8.8) and (8.9), and write 

prediction equation involving the original values of X, Y, and Z, with computed 
values of b, and b, substituted in the equation. 


Example: 
Given rz, = 0.30, rz, = 0.50, and r. 0.6 

30; ra; = 0:50, y; = 0.60; M, = 50, M, = 60, and 

M; = 100; S, = 10, S, = 10, and S, = 20, then $ d 

.50 — (.30)(.60) 


Ves — aul yz 
Bs = T Hs T= 30: 0.352, 
Fyz — Vale .60 — (.30)(.50) 
b= “Ton, T= 30: 0.494, 
and 
Rasu = Brus + Buruz = 4(352)650) + (-494)(60) = 0.687. 


The coefficients, b, and b,, are computed, as follows: 


S; 2 
bz = Brg = 352 a = 0.704, 


and 
Sz 20 
b, = By = 49455 = 0.988. 
y = By Sy 10 
Table 8.4 


MATRICES P, P^, AND C FOR VARIABLES IN TABLE 4.4 


p 


V Ww X Y 


— 0.186350 


V 1.000000 | 0.855403 | —0.232095 
W | 0.855403 | 1.000000 | —0.170851 | —0.084012 

X | —0.232005 | —0.170851 | 1.000000 | — 0.833802 0.637972 
Y | —0.186350 | —0.084012 | 0.833802 | 1.000000 0.537620 


3.924057 | —3.330181 — 0.113690 0.546268 


— 3.330181 3.868282 0.441149 | —0.663428 


—0.113690 0.441149 3.398407 | —2.817722 


0.546268 | —0.663428 | —2.817722 3.395484 


172 Partial and Multiple Correlation 

The prediction equation can then be written 

Z' = b,X + b,Y — bM, — b,M, + M 
or 
Z’ = 0.704 X + 0.988Y — (0.704)(50) — (0.988)(60) + 100 
or 
Z’ = 0.704 X + 0.988 Y + 5.52. 

3. Table 8.4 gives the P matrix for variables V, W, X, and Y in Table 4.4 of 
Exercise 4.1, and P~, the inverse of P. Both matrices were computed on the IBM 650 
in the Computation Center of Pennsylvania State University. (Actual computation 
time for the inverse was less than one minute.) Matrix C, which contains the cor- 
relations with the criterion variable, Z, is also given in Table 8.4. 


Perform the matrix multiplication, 
PAC = B, 


to obtain the matrix of coefficients: By, By, Bz, and By. Also compute Ry, Ov» 
bw, bz, and b,. Write the prediction equation using V, W, X, and Y, and the 


computed coefficients. The formula for Fowey 1S 


Revwry = A Bot: + Baluz + Balas + Porya 
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The following discussion of product moments, and the various d-statistics 
computed from product moments, will be based on their logical properties, 
as these were presented in Chapters 7 and 8, and will not entail consideration 
of the error to which these values might be subject in any formal statistical 
sense. As stated earlier, it is our purpose to make and maintain thedistinction 
between description and inference, as clearly as it is possible to do so. The 
descriptive role of product moments and values computed from them will be 
outlined in the present chapter. The error properties of certain i-statistics 
computed from product moments will be presented in a later chapter. 

Although there is no operation of more fundamental importance in research 
analysis than the specification of a relation between two variables by means 
of an equation, the wide applicability and general desirability of this operation 
are seldom understood and appreciated by the beginning student in his typical 
approach to a research problem. Too often he fails to see the potential and 
the relevance of a function and will, as a consequence, try to assess his results 
by some crude and inferior method of his own devising. The beginner's own 
inventions are usually inadequate not only because they fail to capitalize on 
functional relations, but also because they often result in the discarding of 
valuable information. Two common practices which betray a lack of apprecia- 
tion of the role of functions and an inadequate conceptualization of research 
hypotheses are dichotomization of variables and selection of extreme values 
from distributions. One of the student's first goals in preparing to become a 
researcher should be the development of an understanding and appreciation 
of the great variety of situations in which obtaining a function to represent a 
relation is not only possible, but also highly desirable, as a way of handling 
data. The student researcher's first though in planning and evaluating research 
should be concerned with the possibility of a functional analysis. 

It should be clearly understood, at the outset, that a function can be 
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established for the relation between two variables only when the observations 
on the two variables are paired. How observations become paired seems 
obvious to the experienced researcher, but is often a mystery to the beginning 
student. Competence in research depends to a considerable extent on the 
capacity of the researcher to detect or discern possibilities for obtaining paired 
Observations. Of course, this pairing of observations is never the result of 
arbitrary decisions and arrangements on the part of the investigator. The 
pairing is dictated by the observational procedures of the research. To be 
more specific, the investigator obtains a pair of observations, one on each 
variable, for a single object or a single event. Thus data consisting of N pairs 
of observations imply the measurement of N Objects or events on each of two 
variables. Consequently, this pairing, being the circumstance which makes the 
expression of a relation between variables meaningful, must be held inviolate. 

To convey to the student some idea of the wide range of situations which 
yield paired observations and therefore offer possibilities for functional inter- 
pretation and representation, we shall list below six broad classes or types of 
variables which are of continuing interest to the research psychologist. Under 
each class a few specific examples of variables are named but these are not 
described in detail with respect to the particular operations which are neces- 
sary and appropriate for obtaining the observations. For operational details, 
the student should consult texts or articles specializing in the particular area 
in question. The listing is not exhaustive and the types are not to be taken as 
absolutes in any sense. Following the list of classes with examples will be a 
number of combinations of variables which could be the basis for paired 
Observations and which could conceivably give rise to a meaningful relation 
expressible by means of an equation. These are not meant to be suggestions 
for original research. They are simply intended to enlarge the student's 
concept of functional interpretation and representation. 

Classes of variables of interest to the psychologist: 


l. Data on the physical or social environment of a subject, including 
atmospheric temperature, barometric pressure, humidity, rainfall, altitude, 
latitude, longitude, population of community, size of School, size of family, 
family income, and per capita expenditure for education in the state. 

2. Biographical data, including age, grade in School, amount of education, 
salary, number of years married, and number of children. 


; 3. Body measurements, including height, weight, size of waist, and size of 
oot. 


4. Food, nutrient, and drug consum 
of carbohydrate, amount of protein, 
amount of iron, drug dosage, number 
of water. 


ption, including caloric intake, amount 
amount of fat, amount of vitamin A, 
of cups of coffee, and number of glasses 


5. Series, including successive time 


; periods, trials, and ordinal positions of 
successive events. 
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6. Behavioral measures, including 

a. physical measures of responses or products, such as length of standing 
broad jump, distance of discus throw, weight lifted, time required to run 
100 yards, latency of response, and area covered by a drawing; 
: b. physical measures of a stimulus under standard conditions for count- 
ing discrimination responses, including intensity of a tone, voltage of a light 
Source, concentration of salt in solution, frequency of a tone, and wave 
length of a light source; 

c. frequency counts of responses, including number of correct items, 
number of self-references, number of favorable statements endorsed, and 


number of “ liked " activities; 
d. judgments of response attributes, including rank given by an expert 


judge and average rating given by three observers; 
e. measures of agreement between two sets of rankings of a number of 


items by a single individual. 

To facilitate the outlining of possibilities for functional representation, we 
Shall consistently refer to the two variables in each example as X and Y, and 
shall use NV as the number of pairs of observations. In every case, the 2N 
Observations could be displayed in a measurement matrix of N rows and two 
columns as shown in Table 9.1 and could be represented graphically by means 


Table 9.1 


GENERAL FORM OF A MEASUREMENT 
MATRIX FOR TWO VARIABLES 


Variables 


Xx ¥ 


2 

3 
Pairs of 
Observations 

i 

N 


Of a scatter-plot as indicated, in general form, by Figure 9.1. The coordinates 


of any point in the scatter-plot are the pair of values: X; and Y;. Graphically, 
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the function relating X and Y will be a straight line or a curve of best fit for 
the points in Figure 9.1. 


Plot of points: 


Zo[QY) Ge OY) Co Kye Y) 


x 


Fig. 9.1 General form of the scatter-plot for the paired measures in Table 9.1. N = 
number of pairs of measures — number of points in the plot. 


Typical combinations of variables whose relation might be evaluated by a 
function given by a mathematical equation are listed below. 

1. X = score on a science interest inventory; Y = score on a science 
achievement test; N — number of subjects taking both tests. 

2. X — number of years of formal education; Y — score on an attitude 
scale; N — number of subjects with attitude scale scores. 

3. X — hours of food deprivation; Y — amount of activity measured in 
terms of the number of revolutions of an activity wheel; M = total number 
of successive hourly periods of food deprivation for a single animal. 

4. X — the ordinal number of a trial; Y — latency of a running response; 
N = total number of trials allowed an animal. 

5. X — the rank assigned to an item by a subject in describing his father; 
Y = the rank assigned to an item by a subject in describing himself; N = 
total number of items. 

6. X — physical measure of the intensity of a stimulus; Y — percentage 
of correct discriminations; N = number of different stimulus intensities used 
with a single subject. 

7. X — height of athlete; Y — distance of running broad jump; N — 
number of athletes. 

8. X — age of a salesman; Y — amount of commissions for a specified 
period of time; N — number of salesmen. 

9. X — a student's reported average amount of time per week spent in 
study; Y — a student's final average college grade; N — number of students. 

10. X — a teacher's rank in teaching ability as judged by an expert; Y — a 
teacher's salary; M = number of teachers. 

11. X = number of cups of coffee drunk daily; Y — amount of carbo- 


hydrate in daily diet; N = total number of days over which a single subject 
was observed. 
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Choice of an Equation 


In a strictly theoretical sense, the research psychologist has an unlimited 
number of different equations, from which he might choose one for the 
interpretation and representation of any given set of data. In practice, his 
choice most often turns out to be the simple, linear equation: 


y = by, 


with its associated prediction constant, b,., and product-moment correlation 
coefficient, r,,. The prediction constant, byz, indicates the number of units 
change in Y per unit change in X for the predicted values of Y, that is, change 
as a point moves along the best-fitting straight line. Since the units of many 
psychological variables are not invariant from one test or instrument to 
another, psychologists tend to report the results of their research more often 
in terms of the dimensionless correlation coefficients than in terms of predic- 
tion constants. Correlation coefficients are also employed when there is no 
desire on the part of the investigator to focus upon the direction of the 
predictive relation, as one does in using byz OF bey : 

Three main factors appear to account for the psychologist's frequent 
Choice of the simple, linear prediction equation. The first factor is the con- 
venience which arises mainly from the ease and simplicity of computation, 
graphical representation, and communication. The second factor is the 
Beneral background of experience in psychological research, known and 
Shared by most active researchers, which background suggests that linear 
relations are to be found much more frequently than are nonlinear ones. The 
third factor is that linear functions are often useful approximations to a rela- 
tion, even when the true function is curvilinear. It would be incorrect, of 
Course, to convey the impression here that all relations between variables, in 
Psychological research, are necessarily linear or even approximately so. The 
researcher must be ever on the alert for evidence of nonlinearity." The student 
should also be reminded that, because all relations are not necessarily linear, 
it does not follow that they are necessarily curvilinear. The nature of the 
relation between two variables is, in the final analysis, a matter of empirical 


determination. 


Curvilinear Prediction and Correlation 


There are countless nonlinear equations which might be employed for the 


joe simplest check for nonlinearity is an inspection of the trend revealed in the scatter- 
Plot. 
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relation between two variables when a linear equation is not a sufficiently 
good approximation, but multiple-linear prediction and correlation, dis- 
cussed in Chapter 8, provide the basis for a very practical solution to the 
problem. 

To structure the problem and outline its solution, let us say that the 
criterion variable, Z, is to be predicted from predictor variable, X, but there 
is evidence of a moderate departure from linearity. To allow for this departure, 


we may manufacture a second predictor variable, Y, simply by transforming 
X, as indicated by the formula, 


yq 
We now propose to predict Z frem X and Y, simultaneously, in the equation, 
z' = b,x + byy, 


where x = (X — M,) = X — ZX|N and y — (X? — Me) = X? — ZX*JN. 
Predicting from X and X? amounts to solving for constants in a quadratic 
equation and to the fitting of a parabola to the scatter-plot for Z and X. The 
N X 3 measurement matrix is shown in Table 9.2. Keep in mind that 


Table 9.2 


MEASUREMENT MATRIX FOR 
CURVILINEAR PREDICTION 


2 
as 
N 


Subjects or . . " 
Occasions i Xi Y; Zi 


N Xy Yy Zy 
measures of Z and X are directly the result of observation, but that values in 
the Y column are simply the computed squares of X. The form of the scatter- 


plot for Z and X is suggested in Figure 9.2. One could imagine a plot of points 
displaying a trend for which a curve, and not a straight line, gave the best 


fit. In the case of a simple, linear prediction equation, the coordinates of 
points lying on the line of prediction would be (Z', X) where 


Z' = b,X — bM, + M; 


In the case of curvilinear prediction suggested above, the coordinates of 
points lying on the curve would be (Z', X), where 


Z' = b,X + byY — bM, — byM, + M.. 
Do not forget that Y = X? and M, = XX: 2|N. 


— 


meme —- 
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To solve for b, and b,, we might first solve for. B and f, following the 
procedures outlined in Chapter 8. The computation would involve obtaining 


Plot cf points: 


Zo z) Ge) (XZ) CM 0 Z) 


x 


would consist of points whose coordinates were 
(Xn Z), (s), (Xp Zd Os dr (Xr Zn), pairs of values from Table 9.2. The coordinates 
of any point lying on the best-fitting curve would be (X; Zj) where 

Z'— b,X + b,Y — b,N, — bM, + M. 


Fig. 9.2 The scatter-plot for Z and X 


and 

fex, 
en be substituted in formulas (8.6) 
8) and (8.9) could then be used 
o compute M,, My, and M; 


Fey, azy Pyas Sy Sy, and Sa: Values could th 
and (8.7) to compute B, and B,. Formulas (8. 
to obtain b, and b,. It would also be necessary t 
for the prediction equation. 


Partial Correlation 


Given two variables, X and Y, having a substantial correlation, it is entirely 
Possible that the observed relation between the two can be accounted for, 
wholly or in part, in terms of a third variable, W. That is to say, if we remove 
from X and Y the variability which can, in each case, be predicted from W, 
the correlation remaining between X and Y may be reduced considerably in 
magnitude, The correlation between X and Y after variability predicted from 
W has been removed is the partial correlation coefficient, rey.» We have 
Suggested that rey.w might be, but is not necessarily, less than rsy. An example 
is given below. 

An instructor of an introductory psychology class found the correlation 
between two successive course tests to be 0.70. He suspected that this con- 
Sistency of performance in the course could be accounted for by differences 
in the ability of students as it could be measured by a precourse test of verbal 
ability. He decided to compute the partial correlation coefficient for the 
Course tests independent of the verbal ability test as a check on his explana- 
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tion of the consistency in performance. The correlation between the verbal 
ability test and the first course test was 0.30. The correlation between verbal 
ability and the second course test was 0.40. He used formula (8.1), 

loy.w = I E - LL , 

A — r£ — r2,) 

where variable W was scores on the verbal test, X was scores on the first 
course test, and Y scores on the second course test. He obtained a value of 
0.66 for rzy.~. The result supported his idea that the relation between the two 
course tests was, in part, accounted for by their relations with the verbal test, 
but indicated that a fairly high degree of consistency in performance existed, 
independent of verbal ability. It is possible that the test of verbal ability was 
a poor predictor of course performance because it was a poor measure of 
verbal ability, but it is also possible that other factors such as differences in 


motivation and interest accounted for the consistent differences among 
students. 


Practical Prediction for Selection 


Prediction equations are used widely in industry, education, government, 
and the military for the selection of personnel. The general procedure in a 
selection program involves the following steps: (1) choosing a criterion 
variable and one or more predictor variables; (2) measuring applicants or 
candidates on the predictor variable(s); (3) evaluating. their performances on 
the criterion; (4) doing the computation required in establishing the predic- 
tion equation; (5) testing a new group of applicants or candidates on the 
predictors only; (6) applying the equation to the new group to predict its 
performances on the criterion; and (7) selecting applicants or candidates from 
the new group on the basis of their predicted criterion scores. 


The criterion variable is usually a measure of achievement, proficiency, or 
productivity, The measure may bea grade, a test score, an index of some sort, 
or a rating by a supervisor. A predictor variable is usually a test score or à 
personal history item. Sometimes a single predictor variable is satisfactory: 
but often two or more predictors prove to be better than one. Seldom is there 
an advantage in having more than five predictors. The criterion measure may 
be obtained at the same time as the predictor variables, but it is often 
obtained later. i 

There is a major problem in this selection procedure. Notice that the 
prediction equation is established on one group and applied to another for 
prediction purposes. Data for the original group consist of measures on the 
predictor variables and the criterion variable. From these data, the constants 
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for the prediction equation can be computed. Data for the new group consist 
only of measures on the predictors. The criterion performance of an indi- 
vidual in the new group is predicted by substituting his predictor score(s) in 
the equation. One of the uncertainties as to the success of practical prediction 
relates to the question of whether or not the equation applies to the new 
group. Judging from experience accumulated by psychologists, there is little 
doubt that it is sometimes applicable or appropriate, being more so on some 
Occasions than on others. The appropriateness of the equation for the new 
group depends on the response similarity of that group to the original one 
on which the equation was developed. 

Another problem having to do with the appropriateness of the equation 
arises from the selection which may have taken place in the original group. 
Very seldom does an industry, an educational institution, or a government 
agency employ or accept any and all applicants. In practice there is almost 
always some screening and some rejecting. It is evident then that the predic- 
tion equation is established on a group which has gone through selection. 
The new group, to which the equation is applied, however, is usually an un- 
selected group. Ideally, the equation would be established on an unselected 
group. The effect of this departure from the ideal can be expected to be a 


reduction in the accuracy of prediction. 


Test Construction 


Product-moment correlation has important applications in the work of 


Constructing and revising psychological tests, scales, inventories, and question- 
haires, The reliability and the validity of the psychologist’s measures are often 
expressed in terms of product-moment coefficients. Item analysis and item 
Selection procedures also involve extensive app 
Correlation. 
Determining reliability is the psychologist’s way of assessing reproducibility. 
There are three methods in common usage. The psychologist may administer 
a test on two different occasions to the same group of subjects. The correla- 
tion between the first administration and the second is a measure of test- 
retest reliability. Having administered a test on one occasion to a group of 
Subjects, he may divide the test into two halves. A score is determined for 
€ach subject on each half. The correlation of one half with the other is a 
Measure of split-half reliability. Finally, he may construct two equivalent or 
Parallel forms of a test and administer both forms to the same group of 
Subjects, The correlation of one form with the other is a measure of 
equivalent-forms reliability. 
It can be shown that a coefficient of reliability comp 
differences with intraindividual differences. If we express an 


lications of product-moment 


ares interindividual 
individual’s 
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standing on a test as a deviation from the mean of his group, then his average 
standing for two administrations can be computed by adding the two devia- 
tions and dividing the sum by two. Differences among these average standings 
for the individuals in a group might be called interindividual differences. If 
we measure the variability of the individual's two standings around his 
average standing and combine these measures for all subjects, the result 
might be called a measure of intraindividual differences. When intraindividual 
differences are small, one can say that the measures are reproducible. When 
intraindividual differences are large, the measures are lacking in reproduci- 
bility. Of course, these differences are “ small " or “ large " relative to some 
standard. In expressing reliability by means of a correlation coefficient, the 
psychologist compares the two kinds of differences. Thus intraindividual 
differences are small or large by comparison with interindividual differences. 
Small, when the reliability coefficient is large. Large, when the reliability 
coefficient is small. 

Validity has a number of different meanings for the psychologist. Here we 
shall use the term to refer to concurrent validity and predictive validity, both 
of which are often assessed by correlating a predictor variable and a criterion 
variable. Concurrent validity involves criterion measures obtained at the 
same time as the predictor measures. Predictive validity involves criterion 
measures obtained some time after the predictor measures. Computing the 
product-moment coefficient of correlation for a test and a criterion is 
the most common method of demonstrating the predictiveness of the 
test. 

The student should review the discussion of the functional approach to 


confirmation in Chapter 5 and integrate that discussion with the comments 
on reliability and validity made above. 


Many psychological tests and scales consist of items which are scored as 
dichotomous variables. Subjects’ responses to an item can be classified into 
two categories, such as correct and incorrect, favorable and unfavorable, 
affirmative and negative, true and false, and so on. An individual's responses 
in one category are counted while those in the other category are not counted. 
Thus an individual's score is often simply a frequency count of items. It is 
possible to treat scores for a given item as a dichotomous variable and to 
correlate them with scores for another item, with scores on the test containing 
the item, and with measures on some criterion. The complex of issues sur- 
rounding item selection based on these correlations is beyond the scope of the 
present text. We would like only to point out that formula (7.12) could be 
employed to advantage in computing an item-test correlation or an item- 


criterion correlation, and that formula (7.13) could be used to compute the 
correlation between two items. 


*The logical property of product-moment co 


es F. rrelation, which supports this interpretation, 
is given in Chapter 10. 
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Closing Comment 
» 


In the frequent practical use of prediction equations and in the common 
listing of correlation coefficients in research reports, there are implicit two 
fundamental notions which deserve considerable emphasis in the training of 
students. The first is that prediction and correlation actually constitute a 
functional interpretation of relations among variables. The concept of a 
function is central to the entire development. The second notion is that func- 
tional interpretation through prediction and correlation involves the fitting 
of lines and curves to data which often display much irregularity. Even though 
the irregularity makes fitting necessary, the function concept is still there. It 


is fundamental. 


THE ANALYSIS OF VARIABILITY 
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Classification procedures play an important part in statistical analysis. Two 
kinds of classifications can be distinguished. One kind, the frequency classi- 
fication, includes univariate frequency distributions and bivariate frequency 
distributions, which have been discussed in earlier chapters. In a univariate 
frequency distribution, a set of numbers is represented by categories which 
correspond to intervals on a quantitative scale and by frequencies which 
indicate how many numbers in the set fall into each category on the scale. 
In a bivariate frequency distribution, two sets of paired numbers are repre- 
sented by categories on two scales and by frequencies which indicate how 
many pairs of numbers are in each possible combination of categories, one 
from each scale. Later, frequency classifications in which the categories are 
qualitatively distinguishable will be considered. 


The second kind of classification does not require the use of frequencies. 
Numbers in a set are classified according to the categories to which they 
belong. The categories are sometimes designated qualitatively, sometimes 
quantitatively. In either case, the variability of the numbers can be analyzed 
into components. Classifications of this second kind vary in complexity. We 


Shall consider only two, fairly simple classifications and the analysis of 
variability appropriate to each one. 


Classification by Columns 


Numbers in a set can be classified into subsets and the variability of the 
entire set can be analyzed into two components. 
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Classification by Columns tes 
Let 


n = number of values per subset, a constant; 
c  — number of subsets; 
N = cn = total number of values in the set; 


Yi, = the it value in the kt subset. 
If we let a subset be indicated by a column in a rectangular arrangement of 
the N numbers, the general form of the arrangement is that of Table 10.1. 


Table 10.1 
NUMBERS CLASSIFIED BY COLUMNS 


Columns 
1 2 3 k è 
Yu Yi Nis Yu Vic 
Yu Yoo Yas Yor Ys 
Yu Yoo Yas Yor Ysc 
Ya Yis Yis . Yir : Yi 
Yn 1 Yao Yna ii Y, k * Yne 


Note: There are no rows in the table and the numbers are not, therefore, classified by 
rows. 


The sum of the n values in the k'^ subset or column can be written 
n 
X Ya. 
i 


Notiee that the index, i, governs the summation, which involves terms from 
"s , * H > 
1” to “n” within any column. 


Y., the mean of the n numbers in the kt column, is given by the formula, 


Ž Ya 
Ya s 


The sum of all N values can be written as the sum of the c column sums, 
con 
ZZ Vin 
k i 


The purpose of the double subscript should now be apparent. The index, i, 
Boverns the inner sum which is restricted to the values within a column. The 
Index, k, governs the outer sum which contains the c column sums. It will be 
convenient to use the symbol T, for the sum of all N values. Then 


ca 
T, = the sum of the column sums = A 2 Ys. 
i 
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Let M, = the mean of all N values. Then 


The Analysis of Variability 


The variability in the entire set of numbers can be measured by the sum of 
squares of deviations from the total mean, M,. The sum of squares of devia- 
tions from M, for the numbers in the kt" column is 


È (Ya — MX 


Then the sum of squares of deviations from M, for all c columns is 


2 EY. — My, 
ko. 


which quantity will be called the total sum of squares or the total variability. 
If we expand and simplify this expression, making appropriate substitutions 


as well, a computing formula can be obtained. The formula is 
e mn T2 
ZZY- =. 
EE ik N 


Let C, = T?/N. Then the formula for the total sum of squares can be written 
ES v3, — C. (10.1) 
k i 

Example: 


In Table 10.2 are 15 numbers classifie 
consisting of five numbers. The column s 
are given at the bottom of the rectangul 


d into three subsets, each subset 
ums, means, and sums of squares 
ar arrangement. 


Table 10.2 


AN EXAMPLE OF NUMBERS CLASSIFIED BY COLUMNS 
Columns 


Sums 35 40 60 
Means 7 8 12 


Sums of squares 259 330 738 
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Computation of the total sum of squares requires the sum of the N values 
and the sum of their squares. We find that 
Ty = 135 
and 


È È Yh = 1,327. 
k T 


Having found T,, we can compute Cy, the second term in formula (10.1), 
as follows: 
T3 135% 
C= WW 1,215. 


The total sum of squares of deviations is found by the subtraction, 
1,327 — 1,215 = 112. 
nto two additive components. 


combined within-column sum 
tion and computation of this 


The total sum of squares can be subdivided i 
The first of these components we shall call the 
of squares. We proceed below with the defini 
component. 

Consider the kt column. The sum of squares o 
of the kth column is 


f deviations from the mean 


n 5 
3 (Ya — Fa), 

i 

hat column. Formula (4.2), which was 


derived earlier for computing the sum of squares of deviations from the 
mean for a set of numbers, can be rewritten for a column sum of squares, 
With some slight modifications in the symbolism, as follows: 


a measure of the variability within t 


: d Ya? 
A Ye — n 
Example: 
e first column of Table 10.2 is 


The sum of squares of deviations for th 
computed, as follows: 


85* 
IE A 
259 5 1 
The sum of squares for the second column is 
40? 
380 epo 10. 


For the third column, it is 
2 


0 
788 ——— 18. 


1l 
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Each of the three quantities computed above is a measure of the variability 
within a particular column. The three values can be added, yielding the com- 
bined within-column sum of squares, an overall measure of the within-column 
variability, as indicated by the expression, 


2 X(Y. — Y4y, 
k i 
and by the computation, 


14 + 10 + 18 = 42. 


Observe that the combined within-column sum of squares, which is 42, is 
less than the total sum of squares, which is 112. Measurement of the variabi- 
lity within columns has not taken account of all of the variability in the entire 
set. We proceed now with the definition and computation of another 
component of the total variability. 

The variability within any column can be eliminated by substituting the 
mean of the column for each of the n values within that column. It will be 
instructive to do so. Table 10.3 shows the results of substituting the mean of 
each of the three columns of Table 10.2 for each of the five values in a 
column. Examination of the values in Table 10.3 reveals that the variability 


Table 10.3 


VARIABILITY REMAINING AFTER ELIMINATION OF 
WITHIN-COLUMN VARIABILITY IN TABLE 10.2 


Columns 


has indeed been eliminated within each column, but that some variability 
does remain. It seems appropriate to call the remaining differences in Table 
10.3, between-column variability. The between-column variability can be 
measured by a sum of squares computed from the values in Table 10.3. The 
mean of the 15 values in the table is M, = 9, the mean of the original values 
in Table 10.2. A deviation from M, for a value in the first column of Table 
10.3 is 

(7 — 9). 
The square of this deviation is 

(7 — 9». 
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Observe that all five squared deviations for values in the first column are the 
same. Therefore, for the first column, the sum of squares of deviations from 
M, can be written as the product, 


5(7 — 9}. 

For the second column, the sum of squares of deviations from M, is 
5(8 — 9). 

For the third column, it is 
5(12 — 9y*. 


Combining these three quantities as a measure of the between-column 
variability yields 

5(7 — 9)? + 5(8 — 9)? + 5(12 — 9)? = 70. 
These numerical expressions, which combine to yield the between-column 
Sum of squares, can be written symbolically, as follows: 


n(Y, — Mj? + n(Y.4 — Mj? + n(Y. — My)? 
or 


«Y, — My 
k 
or 


n È (Fa — M}. 
k 


If this expression for the between-column sum of squares is expanded and 
simplified, with appropriate substitutions, the result is a convenient computing 
formula: 


dE. 5. os (10.2) 


Formula (10.2), representing the between-column sum of squares, can be 
translated into verbal instructions, as follows: Find the sum of each column. 
Square each column sum. Sum the squared column sums and divide the result 
by n, the number of values per column. Compute C, — T?/N, where T, is 
the sum of all N values. Subtract C, from the quotient obtained just 
Previously, 

The derivation of formula (10.2) is based on the assumption that n, the 
number of values per column, is a constant. When n varies, the instructions 
ate modified in the following respects: Each squared column sum is divided 

Y the number of values in that column. The c quotients which result are 
then summed, Finally, the correction term, C, is subtracted. 

The between-column sum of squares for the numbers in Table 10.2 was 
Computed from the numbers in Table 10.3 and found to be equal to 70. We 


190 The Analysis of Variability 


shall demonstrate the use of formula (10.2) in computing the same quantity. 
The column sums from Table 10.2 are 35, 40, and 60; n is 5; C, is 1,215. 
Substituting these values in formula (10.2), we obtain 
35* + 40* + 60* 
5 


1,215 70. 


Summary 


Having been given a set of numbers classified by columns, we set out to 
analyze the total variability into two components. We found the total sum 
of squares of deviations from M, to be 112. For each column, we computed 
the sum of squares of deviations from its mean. We then combined the three 
values, 14, 10, and 18, to obtain a measure of the combined within-column 
variability, a sum of squares equal to 42. Finally, we eliminated the within- 
column variability by substituting the mean of a column for each value in that 
column. The remaining variability was called between-column variability. For 
these substituted values, the sum of squares of deviations from M , was found 
to be 70. 

Notice the relation between the total sum of squares and the two com- 
ponents: the between-column sum of squares and the within-column sum of 


squares. In the example, 
112 — 70 4- 42. 


That is to say, the total variability is equal to the between-column variability 
plus the within-column variability. This relation holds true in general. The 
example given above does not establish the generality of the relation, but it 
can be established. The proof will not be given here. 


On Classifications 


Let C, be the kt" category in a set of c categories into whieh a set of N 
numbers can be classified with n, numbers per category. In the classification 
scheme of the preceding section, the categories corresponded to columns in the 
rectangular tabulation. A classification involving one set of categories is 
sometimes called a single classification. There is nothing to prevent our 
classifying the same set of N numbers into another set of categories. 

Let R; be the jtt category in the second set of r categories. Classifying the 
N numbers simultaneously into both sets of categories will be defined here 
as distributing them into new categories specified by the symbol combination, 


Bi s 


i 
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That is, a number is classified into a new category because it belongs to the 
j' category in the R set and the kt category in the C set. The number of new 
categories is the number of possible combinations of R; and Cp, which is 
(r X c). The simultaneous classification can be represented by a rectangle of 
r rows, c columns, and (r x c) cells. The category R;C; is the cell defined by 
the intersection of the jt" row and the kt column. It is sometimes convenient 
to refer to such a classification as a double classification or as an (r X c) 
classification. 

Table 10.4 illustrates the principle of doubly classifying numbers. Columns 


Table 10.4 
SIMULTANEOUS CLASSIFICATION ON TWO SETS OF CATEGORIES 


First Set 
Ci C; C; š C; 1 (eA 


Second 
Set 


and rows of the table are appropriately labeled. Each cell contains a com- 
bination of symbols indicating the combination of categories represented by 
that cell. Symbols in the cells of Table 10.4 are not to be interpreted as 
numbers, The symbols identify the new categories. The table only serves the 
purpose of making explicit the way in which numbers can be classified 
simultaneously with respect to two sets of categories. 


Variability in a Double Classification 


When numbers in a set are classified in two different ways simultaneously, 
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as shown in Table 10.5, the variability of the entire set can be analyzed into 
three components. 


Table 10.5 


SCHEME FOR DOUBLY CLASSIFYING NUMBERS 
WITH ONE NUMBER PER CELL 


Columns 


Rows 


Let N — total number of values in the entire set; c — number of columns 
— number of subsets or categories in the first way of classifying; r — number 
of rows — number of subsets or categories in the second way of classifying. 
Then (r x c) — number of cells — number of combinations in the simul- 
taneous classification. 

Let » = number of values per cell — number of values in each new 
category = 1,1 

If we let Y, be the value in the jth row and the kth column, then the sum 
of the r values in the &'^ column can be written 


EX. 
y 


The mean of the r values in the kt column is 


T,, the sum of all N values,can be written as the sum of the c column sums. 


"Double classifications in which n > 1 will not be discussed here. 
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That is, 


The mean of all N values is 


T, T, 
M = N rc 
The sum of the c values in the jt" row can be written 
[4 
Z2 Y. 
k 
The mean of the c values in the jt" row is 
c 
È Yn 
"n E 
j. c 


The sum of all N values was written above as the sum of column sums. It 
can be written as the sum of row sums, as follows: 


r c 
Ty = A A Yu 
y 
The total variability exhibited by the N numbers is defined by either of two 
equivalent expressions: 


r 


È È (Yn — M 
k j 

or 
5 È (Yn — M} 
j k 


Thus in defining the total sum of squares it makes no difference whether one 
sums the r squared deviations in each column and combines these c quantities, 
Or sums the c squared deviations in each row and combines the r resulting 
quantities, One obtains the same result for the total sum of squares by both 
methods. 

Either one of these expressions for the total sum of squares can be 
€xpanded and simplified. If appropriate substitutions are then made, a 
Convenient computing formula results: 


EEyn-c, (10.3) 
k j 


or 


where C, = T2/N. 
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Example: 
Table 10.6 contains a 3 x 4 classification of numbers with one number per 
Table 10.6 
EXAMPLE OF A DOUBLE CLASSIFICATION OF NUMBERS 
Columns 
1 2 3 4 Row Row 
Sums Means 
1 28 (à 
Rows 2 32 8 
2 12 3 
Column Sums 18 24 15 15 
Column Means 6 8 5 5 
Sums of Squares 
of Original Values 134 200 101 83 


cell. Thus r = 3; c = 4; and N =r x c = 12. The sums and means of rows 
and columns are arranged on the margins of the table. The sum of squares 
of the values in each column is also given below the column. 

Computation of the total sum of squares of deviations by means of 
formula (10.3) requires the sum of squares of all N original values and the 
quantity, C,. The sum of squares of the original values is 

134 + 200 + 101 + 83 = 518. 


The other required value, C, = T2/N, is computed, as follows: 


722 
C, = B~” 432 
Substitution of these values in formula (10.3) yields 
518 — 432 = 86, 


which quantity is the total sum of squares of deviations from M ,. 

The variability within each column of Table 10.6.can be eliminated by 
substituting the mean of a column for each value in the column. Likewise, 
the variability within each row of the table can be eliminated by substituting 
row means. The results of both of these substitution procedures are shown in 
Table 10.7. The variability remaining after the within-column variability has 
been eliminated can be seen in arrangement A. We shall refer to what remains 
as between-column variability. The between-column sum of squares of 
deviations is defined by the expression, 


EY, — My, 
k 
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Table 10.7 
ELIMINATION OF WITHIN-COLUMN AND 
WITHIN-ROW VARIABILITY IN TABLE 10.6 , 


A. Results of substituting column means: 


6 8 5 5 
6 8 3 5 
6 8 5 S 


7 7 7 7 
8 8 8 8 
3 3 3 3 


from which can be derived a convenient formula, 


e r 
Z(Z Yay 
e ees (10.4) 

Formula (10.4) should be compared with formula (10.2). The former is 
simply an adaptation of the latter. Formula (10.4) can be translated into 
Verbal instructions, as follows: Find the sum of each column; square each 
column sum; combine the squared sums and divide by r, the number of rows; 
Subtract C, = T2/N from the resulting quotient. 

Example: 
. The computation of the between-column sum of squares for the numbers 
in Table 10.6 (or for those in arrangement A of Table 10.7) requires the 
Substitution of column sums, a value for C,, and a value for r in formula 
(10.4), as follows: 

18? + 24? + 15? + 15? 
3 us 

The variability remaining in Table 10.6 after the within-row variability has 
been eliminated can be observed in arrangement B of Table 10.7. We shall 
Tefer to this remaining variability as between-row variability. The between- 
TOW sum of squares of deviations is defined by the expression, 


432 18. 


È (F; .— My), 
J 
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from which can be derived a convenient computing formula, 


r [4 
EX YQ 
ME REM is. (10.5) 

Formula (10.5) should be compared with formulas (10.4) and (10.2). The 
common features of the three formulas should be apparent. Formula (10.5) 
can be interpreted as giving the following instructions: Find each row sum; 
square each row sum; combine these squared sums and divide by c, the 
number of columns; subtract C, = 7T ?/N from the quotient. 

Example: 

To compute the between-row sum of squares for the numbers in Table 10.6 
(or for those in arrangement B of Table 10.7), one substitutes values for the 
row sums, C,, and c in formula (10.5), as follows: 


28? + 32? + 12? 
4 


With respect to the numbers in Table 10.6, we have established that the 
total sum of squares is 86, the between-column sum of squares is 18, and the 
between-row sum of squares is 56. Observe that the sum of the two com- 
ponents is less than the total. There is some remaining variability, for which 
we have not yet accounted. What remains is called the residual variability. 

The residual sum of squares is defined, as follows: Let rj, be a residual 
deviation for the cell in the jtt row and the kt column. Then the sum of 
squares of the N residuals is 


432 — 56. 


e r 

2 
EE. 
k j 


A residual deviation, r;,, is defined as 
; 200 Og — My) — (OF; — My) — (F4 — My). 
That is, rj, is what remains of the deviation, (Y;, — M,), after subtraction 
of the deviation, (Y;. — M ,), and the deviation, (Y.,, — M,). 
When the terms in the defining expression are combined, one obtains 


Vi = Yin — Yj. — Ye + My (10.6) 


Example: 
The residual deviations for the numbers in Table 10.6 are displayed in 
Table 10.8. The computation of four of the twelve residual deviations is given 


Table 10.8 


RESIDUAL DEVIATIONS FOR 
NUMBERS IN TABLE 10.6 


0 +1 0 ex 
+1 —2 +1 0 
=1 +All =l Fd 
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below. In the computation, values from Table 10.6 were substituted in 
formula (10.6). The student should verify the computation of the other eight 
residuals. 

rı= 71—7—6+6= 

F21 

lai 2—3—6-6 

rye = 10—-7—8+4+6 t 


The residual sum of squares is the sum of squares of the twelve values in 
Table 10.8: 


The total variability of the numbers in Table 10.6 has now been analyzed 
into three components. The total sum of squares is 86; the between-column 
sum of squares is 18; the between-row sum of squares is 56; the residual sum 
of squares is 12. Notice that the total is equal to the sum of the three com- 
ponents. That is, 


86 = 18 + 56 + 12. 


The additivity of these three components is an example of a relation whose 
generality will not be proved here. It can be shown that the total sum of 
squares is equal to the sum of the three components: the between-column 
sum of squares, the between-row sum of squares, and the residual sum of 
squares. The example itself does not, of course, constitute a proof of the 
generality of the relation. 


Summary 


Two ways of analyzing the variability in a rectangular table of numbers 
have been described in the present chapter. The first method of analysis was 
applied to a single classification of numbers. The total sum of squares was 
subdivided into two components: the between-column sum of squares and 
the within-column sum of squares. It was observed that the two components 
were additive, that is, they combined to yield the total. The second method 
of analysis was applied to a double classification of numbers. The total sum 
of squares was subdivided into three components: the between-column sum 
of squares, the between-row sum of squares, and the residual sum of squares. 
The three components combined to give the total. 

. Two principles of analysis were stated. The first was that the total variabi- 
lity in a single classification of numbers can be analyzed into two additive 
components: the between-column sum of squares and the within-column sum 
of squares. The second was that the total variability in a double classification 
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of numbers can be analyzed into three additive components: the between- 
column sum of squares, the between-row sum of squares, and the residual 


sum of squares. 


Double Classification and Correlation 


Product-moment correlation was discussed in Chaptei 7 with reference to 
a double classification of numbers, although the tabulation was not so desig- 
nated. Table 7.1 consisted of N rows and two columns with one entry per 
cell. The columns corresponded to two sets of numbers, X and Y. The rows 
corresponded to pairs of numbers, a pair consisting of one number from each 
set. In a table of r rows and two columns, the correlation between the columns 
can be expressed in terms of the between-row variability, the residual 
variability, and the variability within the columns. A connection is thereby 
established between correlation and the analysis of variability for classifica- 
tions of numbers. 

To emphasize the connection between correlation as a measure of con- 
comitance for paired values of X and Y, and the analysis of variability for 
anr x 2 classification of numbers, we shall employ two different systems of 
symbols for the same quantities. 

Let ria = the correlation between the columns of an r x 2 classification 
of numbers — r,, — the correlation between two sets of paired numbers, 
X and Y. 

Let B = the between-row sum of squares for an r x 2 classification = the 
between-pair sum of squares for variables X and Y. 

Let R — the residual sum of squares for an r x 2 classification — the 
residual sum of squares for the N x 2 table of X and Y. 

Let W, — the sum of squares for the variability within the first column of 
ther x 2 classification = W, = the sum of squares of deviations for X. 

Let W, = the sum of squares for the variability within the second column 
of the r x 2 classification = W, = the sum of squares of deviations for Y. 

It can be shown that the sum of products of deviations for the arrays of 
either arrangement is equal to (B — R)/2, from which it follows that 


rh = e (10.7) 
24W,W, 
and that 
B—R 
Fey = TTL (10.8) 
24W,W, 


Formula (10.7) states that the correlation between the columns of an r x 2 
classification is a function of the between-row sum of squares, the residual 
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sum of squares, and the variability within each column. Formula (10.8) states 
that the correlation between X and Y is a function of the between-pair sum 
of squares, the residual sum of squares for the N x 2 table, and the sums of 
squares for Y and Y. Imagine that the W/s are fixed values in both formulas. 
It follows that r}, and r,,, will increase in algebraic value as B increases and 
R decreases. When 


BR, 
the correlation is positive; when 

B=R, 
the correlation is zero; when 

PZR 


the correlation is negative. 


Pragmatics and the Analysis of Variability 


Although methods of analyzing variability into component sums of squares 
can be applied to any table of numbers having the specifications set forth 
early in the present chapter, these methods of analysis are seldom used for 
purely descriptive purposes. They are, however, widely used in statistical 
inference, especially in the analysis of experimental data. Since the descriptive 
uses of these methods are of little importance, no discussion of them will be 
undertaken in this text. The very important role of these methods of analysis 
in statistical inference, in connection with the evaluation of research data, 
will be treated in Chapter 15. At that time it will be necessary to take up the 
computation of variances from the component sums of squares. 

There is one topic in descriptive statistics which merits comment at this 
Point. In Chapter 9, the use of product-moment correlation as a measure of 
reliability was presented. In that connection, an allusion was made to an 
Interpretation of the coefficient in terms of interindividual variability and 
intraindividual variability. It is this interpretation which deserves additional 
comment. 

In the earlier discussion of Chapter 9, it was stated that the product- 
moment correlation coefficient compared interindividual variability, the 
differences among the average standings of the subjects tested on two occa- 
Sions on the same test, with intraindividual variability, the difference in 
Standings for each subject on the two occasions.* This comparison is made 
explicit by formula (10.8). B stands for the between-pair sum of squares, 


C TP 
of a individual's “ standing " was defined as the deviation of his score from the mean 
€ group on a given occasion. 
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which is a measure of differences among average standings. R stands for the 
residual sum of squares, which combines measures of the variability in the 
two standings for each one of the N persons tested. Because the correlation 
can be expressed in terms of the comparison of interindividual variability and 
intraindividual variability, psychologists often refer to a reliable test as one 
which reveals large individual differences, or as a very discriminating test, 
that is, one which discriminates among respondents. 


SAMPLING THEORY 


Syntactics | 11 


The most important applications of statistics involve what is known as 
statistical inference. In evaluating research data by the methods of statistical 
inference, one attempts to go beyond the mere description of a particular set 
of data with its unique temporal and spatial context, to take account of error 
in the data and in values computed from the data, and to arrive at proposi- 
tions of some more general and transcendental meaning. These statistical 
inferences from the particular to the general are not always correct, but, under 
certain conditions, they endow the evaluation with a rigor which has not yet 
been, and perhaps never will be, equaled by any other means. 

Underlying the applications of statistical inference is a body of theory 
concerning error. The syntactical features of this body of theory should be 
made explicit before any attempt is made to use the theory. In the present 
Chapter, we shall try to sketch the principal features of a basic part of this 
theory, a part which might be called sampling theory. In doing so, many 
detailed developments which would ordinarily be covered in a course in 
mathematical statistics will be omitted.! It is our intention here to attempt to 
Convey to the student some understanding and appreciation of the nature of 
the theory of random errors, by means of a very general and quite elementary 
formulation. 

In addition to the underlying theory of random errors, there are certain 
Considerations which are important in supporting and guaranteeing the rigor 
Of statistical inferences. This additional set of considerations is semantical in 
nature and has to do with (1) the adequacy of measurement, a topic which 
has already been discussed in Chapter 5, and (2) the adequacy of the opera- 
tions by means of which an investigator tries to realize in practice the 
requirements of the theory with respect to the nature of errors. 

"Students who wish to consult a standard, detailed text in mathematical statistics might 


nine R. V. Hogg and A. T. Craig, Introduction to Mathematical Statistics (New York: 
acmillan, 1959). 
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Chapter 12 will present in summary form some of the major inferential 
statistics with definitions and brief descriptions of their logical properties with 
respect to error. In Chapter 13, the semantical issues of statistical inference 
will be discussed. 

Let us begin our discussion of sampling theory with a frequency distribu- 
tion for variable X. We shall refer to this frequency distribution as a popula- 
tion or parent distribution because we plan to form samples out of values 
chosen from the distribution. Let X; be the value of X in the jt" score interval 
of the population. Let f; be the frequency of occurrence of X; in the popula- 
tion. Further, let r be the number of distinct and equal score intervals in the 
distribution. Finally, let N equal 2f, the total number of values in the 
distribution. The distribution is shown in Table 11.1. 


Table 11.1 
A POPULATION DISTRIBUTION 
X f 
x Ai 
Xa f 
X Í 
X; fj 
X, " 


r = number of distinct score intervals. N = Zf; = total number of values. 


We propose next to adopt a system for forming combinations of values 
taken from the parent distribution. Any combination of values resulting from 
the application of this system will be called a random sample. The system of 
forming combinations, although it may seem arbitrary and artificial to the 
student, makes possible extremely powerful deductive procedures. In the 
system, we shall follow the rule that, if we wish to form one or more com- 
binations of values taken from the population distribution, there is no 
restriction on the repeated use of any one value.? For example, if we desire 
to form several combinations of two values, any one of the pairs of values in 
Table 11.2 is allowed by the system. We shall say that giving the particular 
values in a combination and the order of their selection identifies a way of 
choosing a combination. Thus the pairs listed in Table 11.2 are examples of 
different ways of choosing combinations of two values. 

It is worth noting that all possible different ways of choosing two values are 
defined by the (r x c) square matrix of Table 11.3. In the matrix, X; is the 
first value chosen for the pair and Y; is the second value. Note that the values 
of the parent distribution are arranged along both margins. In this instance, 


?In another system the repeated use of any single value is not allowed. 
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r — c and the matrix is square. In interpreting the matrix, we let the inter- 
section of the jt" row and the kt column represent a way of choosing two 


Table 11.2 


EXAMPLES OF PAIRS OF VALUES ALLOWED BY THE SYSTEM 
FOR SELECTING FROM THE POPULATION IN TABLE 11.1 


First Choice Second Choice 
Pf Xs 
Xs Xx 
Pairs or x Xa 
Ways of Xs x 
Choosing Xa X; 
x X 
Pf X. 
Xs Xs 


values, X; and X,. In the diagonal cells running from the upper left corner 
to the lower right corner are combinations for which j = k and X; = Xp 


Table 11.3 


ALL POSSIBLE DIFFERENT WAYS OF CHOOSING TWO VALUES 
FROM THE POPULATION DISTRIBUTION IN TABLE 11.1 


Second Choices 


First 


Choices 


Note: In this instance, r = c and the matrix is actually square. 


us matrix is symmetrical around the diagonal in the sense that the com- 
SUC, for the second row and the third column involves the same values 
d iene for the third row and the second column, but chosen in 
huis eia To represent all possible different ways of choosing com- 
ree values, a figure of three dimensions, that is, a cube, would 
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be required. For combinations of n values, an n-dimensional arrangement 
would be needed. 

The cells of the square in Table 11.3 define all possible different ways of 
choosing combinations of two values. A way of choosing two values is 
identified by listing the values in the order of their selection. When there are 
r distinct values in the population, the number of different ways of choosing 
combinations of two is r?. 

There is an important distinction which should now be made. The dis- 
tinction involves the number of ways of choosing and the total number of 
possible combinations. The total number of possible combinations of two 
values is N?, a conclusion reached by observing that there are N possible 
first choices and, for each of these, N possible second choices. 

To determine all possible combinations of two values, we must take account 
of the frequency in each interval of the scale of X. Consider the cell in the 
second row and third column of Table 11.3. It represents the combination, 
X, chosen first and X, chosen second. We have said that X, occurs with a 
frequency of f, and X occurs with a frequency of f, in the population. The 
number of possible combinations of X chosen first and X, chosen second is 
(fa X fa). The reasoning which supports this conclusion is simple. There are 
fa possibilities in choosing X, first; for each of these, there are f; possibilities 
in choosing Y, second. It follows that the total number of these combinations 
is (fs X fy). 

Table 11.4 gives the number of possible combinations for each way of 
choosing two values. 


Table 11.4 
NUMBER OF POSSIBLE COMBINATIONS PER WAY 
OF CHOOSING TWO VALUES 


Second Choices 


fk 
f 
fs 
Ss 
First 
Choices 
fj 
fe 


Note: In this instance, r = c and the matrix is square. 
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To summarize, the number of different ways of choosing two values is r°; 
the number of possible combinations of two for any way of choosing is 
Fifi; the total number of possible combinations of two is N*. In general, the 
number of different ways of choosing combinations of n values is r”; the 
number of possible combinations for any way of choosing is the product of 
n marginal frequencies; and the total number of possible combinations is N". 

The frequencies in the cells and on the margins of Table 11.4 are often 
expressed as relative frequencies. In the population and in each of the 
corresponding marginal distributions, p;, the relative frequency for the j'^ 
interval, is defined as 


at 
Pi N 

and pp, the relative frequency for the Kt^ interval, is defined as 
Pk = f. 


In the square matrix of Table 11.4, the relative frequency for the cell in the 
J™ row and the kt® column is 
Sife hf 
Pik = pe = NN PPE 
Observe that the relative frequency for a cell is equal to the product of the 


two relative frequencies on the margins. Observe also that 


2 pi =], 


Zp =l, 
and 
c r 
X pj, = 1. 
kj 


In further discussions of the population distribution, the corresponding 
Marginal distributions, and the distribution of combinations, we shall employ 
relative frequencies. 


Distribution of the Sample Mean 


Any combination of n values chosen from the population distribution in 
Table 11.1 according to the strictly logical system outlined above is, by 
definition, a random sample. Any value computed from the values in a 
random sample is an j-statistic. It is of very great importance that a frequency 
distribution can be deduced which will specify the values taken by a given 
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i-statistic for all possible random samples of a particular size. The frequency 
distribution of values of a given i-statistic is called the sampling distribution 
of that i-statistic. Of course, many different i-statistics can be computed from 
the values in a sample. For demonstration purposes, the sample mean will do 
quite well and we shall use it to show the deductive process by means of 
which a sampling distribution can be established. 

Let us again work with samples of two values. For any sample, the sum of 
the two values in that sample will be 

X; + Xs 
and the sample mean will be 
E L4. 
wc 2 
Since M, is computed from the values in a random sample, it is an inferential 
statistic or an i-statistic. 

We now propose to establish the sampling distribution of Mz. The sampling 
distribution is a relative frequency distribution for values of M, computed 
from all possible random samples of two values from a given population. To 
arrive at the sampling distribution of M,, we first determine the values M; 
can take for all the different ways of choosing samples of two values. We next 
combine the relative frequencies for each different value of M,. These com- 
bined relative frequencies can be displayed in a new distribution showing the 
relation between relative frequency and values of Mz. 

An example may help in clarifying the procedure of establishing the 
sampling distribution of M,. Table 11.5 contains a simple, three-interval 


Table 11.5 
AN EXAMPLE OF A POPULATION 


x p 

0 0.50 
1 0.20 
2 0.30 


frequency distribution which will constitute the population in our example. 
Although the distribution possesses only three score intervals, the values of 
X are small, whole numbers, and the relative frequencies have the values 
shown in the table, it qualifies as a population and will be quite satisfactory 
for the example. Keep in mind that, syntactically speaking, a population is 
nothing more than a frequency distribution. 

In the example, r = 3 and r? = 9. The nine different ways of choosing 
combinations of two values of X are specified by the entries in the nine cells 
of Table 11.6. The first choice is placed on the left in each cell; the second 
choice is placed on the right. The relative frequency for each cell is the pro- 
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Table 11.6 


WAYS OF CHOOSING TWO VALUES FROM 
THE POPULATION OF TABLE 11.5 


In each cell, the first choice is on the left, the second is on the right. 


duct of the marginal relative frequencies. Relative frequencies for the margins 
and the cells are displayed in Table 11.7. The value of M, for the combination 


Table 11.7 
RELATIVE FREQUENCIES FOR WAYS 
OF CHOOSING TWO VALUES 


0 1 2 — dE 
S50 20 30 De 


X; hi 


in each cell has been computed and is shown in arrangement A of Table 11.8. 
In arrangement B of the same table, cells containing the same value of M; 


Table 11.8 
MEANS OF SAMPLES OF TWO VALUES 
A 
Xe 
0 1 2 
0 
Xp 1 
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have been assigned a common label from the letters: A, B, C, D, and E. The 
combined relative frequency for each different value of M, is the sum of the 
relative frequencies in cells having a common label. If we let the labeling 
letters be variables representing relative frequencies, then for M, = 0, the 
relative frequency is 4 = .25; for M, = .5, the combined relative frequency 
is2B = .10 + .10 = .20; for M, = 1, the combined relative frequency is 
XC =.15 + .04 + .15 = .34; for M, = 1.5, the combined relative frequency 
is YD = .06 + .06 = .12; and finally, for M, = 2, the relative frequency is 
E = ,09. 

If we let P be the relative frequency of any value of M,, then the sampling 
distribution of M, for samples of two values is the relative frequency dis- 
tribution of Table 11.9. The sampling distribution of M, gives the distribution 
of all possible values of M, for samples of a specified size. 


Table 11.9 
AN EXAMPLE OF A SAMPLING DISTRIBUTION 


Mz P 
0 .25 
5 20 

Li 34 

145 12 

2. .09 


M,, computed from samples of two values from the population in Table 11.5. 


The reasoning which entered into the determination of the distribution of 
M,, given in Table 11.9, is a very simple example of deduction in sampling 
theory. Although the extensions and elaborations of deduction in sampling 
theory go far beyond this simple example and sometimes involve advanced 
and difficult mathematics, all theoretically deduced sampling distributions 
serve basically the same purpose. The sampling distribution of any i-statistic 
is simply a specification of the relative frequencies of the totality of possible 
values for the particular /-statistic. 

The i-statistic in the example given above was a mean computed from a 
single sample. There are, of course, many other i-statistics. Sampling dis- 
tributions have been established for values, other than the mean, computed 
from a single sample, for values computed from a single sample from a bi- 
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variate or multivariate distribution, for values computed from two or more 
samples, and for values computed from samples classified in a great variety 
of ways. 

The deductive procedures employed in obtaining the sampling distribution 
in Table 11.9 are applicable to discrete population distributions. Discrete 
distributions are those having a finite number of score intervals. The deductive 
procedures of advanced work in sampling theory differ from those of the 
example in three respects. First, the population distribution is continuous; 
that is, r, the number of score intervals, is indefinitely large. Second, the 
relative frequencies of the population distribution are specified by an equation 
for a continuous function. Third, the relative frequencies of the sampling 
distribution are specified by an equation for a continuous function. 

The general principle we have been trying to communicate is summarized 
in the following conceptual formula: A specified population distribution plus 
a standard deductive procedure yields the sampling distribution of an 
i-statistic. 

None of the deductive procedures of advanced work in sampling theory 
Will be given here because these topics are appropriate to courses in mathe- 
matical statistics. What are important to psychological statistics and to the 
use of statistics in psychological research are the results of the deductions, the 
sampling distributions. In Chapter 12, the sampling distributions of a number 
of the more important i-statistics will be identified and the general nature of 
the information we need about these sampling distributions will be described. 


Exercise 11.1 


Given in Table 11.10 is a discrete population distribution. Obtain the sampling 


Table 11.10 
A POPULATION DISTRIBUTION 


X p 
0 20 
1 .50 
2 .20 
3 10 


distribution for the mean of random samples of two values. What steps would be 
necessary to extend the procedure to samples of three values? 


SAMPLING 
DISTRIBUTIONS 


Syntactics | 12 


In deducing the form of a sampling distribution, one must begin with a | 
specification of the population distribution. A vast part of sampling theory | 
is concerned with the drawing of random samples from a population dis- 
tribution in which the relative frequency of X is specified by a normal 
probability function or normal curve. Although there are many other 
functions or curves which could serve as population distributions, the family 
of normal curves has occupied a central position in statistical theory. Because 
of its central position, a number of i-statistics deriving from normal curve 
theory will be discussed in this chapter. 

The equation for a normal curve is 


1 — G—ur 
p= Se 207 


42 | 


x 


Fig. 12.1 A normal probability function or normal curve. It is frequently employed to 
represent a population distribution in statistical inference. (The variance is c2) 
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. where p is the relative frequency expressed as a function of X; p is the mean 
of the population; o? is the variance of the population; e is a constant value 
of approximately 2.71828; and z is a constant value of approximately 3.14159. 
A normal curve is symmetrical and unimodal. Its branches approach, but 
do not reach, the horizontal axis no matter how far they are extended. S, the 
index of skewness, is 0 and K, the index of kurtosis, is 3. An example of a 
normal curve is given in Figure 12.1. 
Imagine now that the function in Figure 12.1 is a population frequency 
| distribution. Of interest to us here is the problem of forming all possible 
random samples, of some specified size, from this normally distributed popu- 
lation and determining the distribution of all possible values of some specified 
~ t-statistic computed from each random sample. The theoretical process of 
determining the sampling distribution of a specified i-statistic is one of 
mathematical deduction beginning with the equation for a normal curve and 
ending with another equation for the sampling distribution. Some of these 
sampling distributions are primarily of theoretical interest and importance; 
| others are of great practical importance in the evaluation of research data. 


Sampling Distributions of Theoretical Interest 


THE MEAN OF A SAMPLE 


M,, the mean of a random sample of n values, is itself normally distributed 
with mean equal to jz, the population mean, and variance equal to o?/n, where 
9* is the population variance. A normal curve representing a sampling dis- 
tribution for M, is shown in Figure 12.2. That is to say, when the population 


L 
E 
Mx 


4 Fig. 122 A normal curve representing the sampling distribution of values of M; 
omputed from all possible random samples from a normally distributed population. 
€ variance is o2/n.) 
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is normally distributed, the totality of values for means of random samples of 
some fixed size is also normally distributed. 


THE VARIANCE OF A SAMPLE 


V. the variance of a random sample of n values, is seldom used as an 
i-statistic and its sampling distribution receives little attention. In place of 
V,,, the i-statistic, V ;, is employed. V; is defined as 


Dy 2 
ees 
n—]l 
Thus V; is the sample variance computed by dividing the sum of squares of 
deviations by (n — 1) instead of n. The divisor, (n — 1), is usually called the 


number of degrees of freedom. V 1 is sometimes referred to as the corrected 
value of the sample variance. The reason is that 


y n 
eem n—1 Ve 
Some additional implications of this correction will be discussed in connection 
with the estimation of population values. 
V!, the sample variance computed with degrees of freedom, has a known 
sampling distribution whose equation will not be given here. That is to say, 
when the population distribution is normal, all possible values of V; for 


+ 
0 2 
g Vx 
Fig. 12.3 The sampling distribution of V; when n = 2. (The mean of the distribution 
is o.) 
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samples of some specified size from that population form a distribution exactly 
fitted by a known mathematical function. The mean of the sampling distribu- 
tion of V; is o?, the population variance. Figure 12.3 shows a distribution of 
V; whenn — 2. It is worth noting that a closely related variable, (n — 1)V //o?, 
is distributed as chi square. That is, it has a sampling distribution specified 
by one of a family of functions called chi square functions. A particular chi 
Square distribution is identified by its mean. The mean of a chi square distri- 
bution is equal to (n — 1) the number of degrees of freedom employed in 
computing V. 


THE DIFFERENCE BETWEEN TWO SAMPLE MEANS 


Given all possible combinations of two random samples, each sample 
Consisting of n values from a normally distributed population, D», the 
difference between the means of two samples, is normally distributed. The 
mean of the sampling distribution is zero and the variance is equal to 20°/n. 
A sampling distribution of D, is displayed in Figure 12.4. 


l 


0 
Dx 


" Fig. 124 A normal curve representing the sampling distribution of Dz, the difference 
Detween means of two random samples of 7 values. The mean of the sampling distribution 


Is : A 
zero. (The variance is 202/n.) 


BETWEEN-SAMPLE AND WITHIN-SAMPLE VARIANCES 


, In Chapter 10, it was shown that the total variability of numbers classified 
to subsets could be analyzed into additive components. Sampling provides 
One basis for classification. Numbers selected from a population distribution 
can be classified according to the sample in which they occur. For example, 
"We were given c samples of n values from a normally distributed population, 
We could classify the (cn) values into c subsets or columns corresponding to 
ISG Samples. The total sum of squares for the N = cn values could then be 
nalyzed into two components: a between-column or between-sample sum of 
QUares and a within-column or within-sample sum of squares. Variances can 


a 
Si 
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be computed from these two components. V ;, the between-sample variance, 
is computed by dividing the between-sample sum of squares by a number of 
degrees of freedom equal to (c — 1). In the case of two samples, of course, 


3 di 
2 dí 
P 
8 
0 EZ 


v 


v 


20 df 


5 df 


Ó P2 


v 


Fig. 12.5 Functions representing the distributions of either Vj or Vi. Each distribution 
is identified by the number of degrees of freedom used in computing the variance. Each 
distribution has a mean equal to c?. 
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(c — 1) = 1. Vj, the combined within-sample variance, is computed by 
dividing the combined within-sample sum of squares by a number of degrees 
of freedom equal to (N — c). In the case of two samples, (N — c) — (N — 2). 

Vi, the between-sample variance, has a known sampling distribution, 
whose equation will not be given here. The mean of the sampling distribution 
is o?. Vi, the within-sample variance, also has a known sampling distribution. 
Again the mean of the sampling distribution is o°. Figure 12.5 shows functions 
which can be viewed as representing distributions of either V ; or Vw, depend- 
Ing on the number of degrees of freedom employed in computing the 
i-statistic, For example, the function labeled with two degrees of freedom 
Could be the sampling distribution of V ; for three samples or of Vj, for two 
Samples of two values each. Notice that the number of degrees of freedom 
employed in computing V; for three samples is two and that the number of 
degrees of freedom employed in computing V, for two samples of two values 
each is two, The function labeled with three degrees of freedom could be the 
sampling distribution of V | for four samples or for V, for two samples, one 
of three values, the other of two. The function labeled with five degrees of 
freedom could be the sampling distribution of V ; for six samples or of py 
for samples of three and four values. The function labeled with twenty degrees 
9f freedom could be the sampling distribution of Vj, for four samples of six 
Values each or for five samples of five values each. The student should figure 
SUE other combinations of c and n for which the sampling distributions of 
V ior Vi, would be represented by these functions. 
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THE DISTRIBUTION OF : 


There is an important ;-statistic which is designated f. A function which 
Specifies the sampling distribution of t is called a r function. There is a family 
9f t functions, one for each particular number of degrees of freedom involved 
T the computation of ż. Figure 12.6 shows t functions for 1, 2, 5, and 30 

“grees of freedom. A z function is symmetrical and unimodal. Its branches 
Maie but never reach the horizontal axis no matter how far they are 
fin ended, Its mean is zero. Its variance is //(/ — 2) if f > 2, where f is the 
is ee of degrees of freedom identifying the function. Its index of skewness 
se m Its index of kurtosis is greater than three. As the number of degrees 

reedom increases, the distribution of t approaches the normal form. 
Values Of t can be computed from a variety of other i-statistics. Three cases 


are Biven below. 


216 Sampling Distributions 


1 df 
P 
=r 
-1 O +! 
t 
2 df 
Sdf P 
P 
- M 
t 
T rr 
-1 0H 
t 
30 df 


Fig. 12.6 Functions for the sampling distribution of ¢ for 1, 2, 5, and 30 degrees of 
freedom. 
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1. For a single random sample of n values from a normally distributed 
population, 


WE ei (12.1) 
JV in 

Formula (12.1) can be interpreted as saying that the ratio on the right is 
distributed as 7. The ratio compares the difference between M,, the sample 
mean, and p, the population mean, with a value computed from Vi, the 
sample variance computed with (n — 1) degrees of freedom. The sampling 
distribution of t computed by formula (12.1) is given, in Figure 12.6, by the 
t function for one degree of freedom when n = 2, by the ¢ function for two 
degrees of freedom when n = 3, by the ż function for five degrees of freedom 
when n = 6, and by the : function for thirty degrees of freedom when n —31. 
2. For two random samples from a normally distributed population 

distribution of Y, 
D. 


= (12.2) 
t = BV 


where D, is.the difference between two sample means, V, is the combined 
Within-sample variance computed with 2(n — 1) degrees of freedom, and n is 
the number of values per sample. Formula (12.2) can be interpreted as saying 
that the ratio on the right is distributed as t. The ratio compares the difference 
between two sample means with a value computed from the combined 
within-sample variance. The number of degrees of freedom is 2(n — 1). The 
sampling distribution of t computed by formula (12.2) is given in-Figure 12.6 
by the ¢ function for two degrees of freedom when n = 2, and by the t 
function for thirty degrees of freedom when n = 16. 

3. For a single sample of paired values from a bivariate population 
distribution of Y and Y, 

Tey N — 2 


"m (12.3) 


In formula (12.3), r», is the product-moment correlation coefficient computed 
from a sample of N pairs of values. In this case, the sampling distribution of 
t contains values which could be computed from the totality of sample correla- 
tion coefficients based on all possible pairings of values of Y with a given set 
of N values of X. The deductions which establish that the ratio in formula 
(12.3) is distributed as require that the Y's for any given X are normally 
distributed and that p, the population correlation, is zero. The number of 
degrees of freedom is (M — 2). The sampling distribution of ¢ computed by 
formula (12.3) is given in Fig. 12.6 by the ¢ function for two degrees of 
freedom when N — 4, and by the ¢ function for thirty degrees of freedom 
when N = 32. 
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Another important i-statistic, designated F, is the variance ratio. A 
function which describes the sampling distribution of Fis called an F function. 
There is a family of F functions. A particular curve from the family is 
identifiedby a combination of degrees of freedom. The combination consists 
of the two numbers of degrees of freedom which enter into the computation of 
the two variances in the ratio. Figure 12.7 contains six F functions. Each of 
the six functions is labeled with two numbers giving the combination of 
degrees of freedom for that function. 


i'and: dF 1 and 30 df 


[es 
7 
O1 


2 and 4 df 
2 and 20 df 


| 
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5 and 5 df 20 and 20 df 


F i F 


Fig. 12.7 Functions for the sampling distribution of F for six combinations of degrees 
of freedom. 


Fis a ratio involving two variances. The ratio can be computed in a variety 
of ways, three of which are listed below. 

1. For c > 2 random samples from a normal population distribution, 
V 1 
Fe v: A (12.4) 
Let n = the number of values per sample; let N = cn. V, is the between- 
sample variance computed by dividing the between-sample sum of squares 
by (c — 1) degrees of freedom. V, is the combined within-sample variance 
computed by dividing the combined within-sample sum of squares by (N — c) 
degrees of freedom. F is the ratio of the between-sample variance to the 
within-sample variance. The sampling distribution of F computed by formula 
(12.4) is given in Figure 12.7 by the F function for one and two degrees of 
freedom when c = 2 and n = 2, and by the F function for one and thirty 
degrees of freedom when c = 2 and n = 16. It is interesting to note that, when 
€ — 2, F computed by formula (12.4) is equal to ¢? when t is computed by 
formula (12.2). 

2. For two random samples from a normally distributed population, 


t 
an, (12.5) 


~ E the number of values ina first sample; let 5 = the number of values 
"isis sona sample. Again F is the ratio of two variances. In the numerator 
iiia riance for a first sample computed by dividing the sample sum of 

es by (n, — 1) degrees of freedom. In the denominator is the variance 
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for a second sample computed by dividing the sample sum of squares by 
(nz — 1) degrees of freedom. The sampling distribution of F computed by 
formula (12.5) is given in Figure 12.7 by the F function for two and four 
degrees of freedom when n, = 3 and n = 5, by the F function for five and 
five degrees of freedom when n; = n; = 6, and by the F function for twenty 
and twenty degrees of freedom when n, = n, = 21. 

3. For a single sample of paired values from a bivariate population 
distribution of X and Y, 
rauN — 2), (12.6) 


eae ir 


which is equivalent to a ratio involving two variances, 
Vy 
Vi-w 

where V;, is the variance of predicted values and Vy is the variance of 
errors of prediction. Vj, is computed by dividing Sy", the sum of squares 
of predicted values, by one degree of freedom. Viy is computed by dividing 
Ay — y» , the sum of squares of errors of prediction, by (N — 2) degrees of 
freedom. In formula (12.6), r+, is the product-moment correlation coefficient 
computed from a sample of N pairs of values. In this case, the sampling 
distribution of F contains values which could be computed from the totality 
of sample correlation coefficients resulting from all possible pairings of 
values of Y with a given set of values of X. The deductions which establish 
that the ratio in formula (12.6) is distributed as F require that the Y’s for any 
given X are normally distributed and that p, the population correlation, is 
zero. The student should compare formula (12.3) and formula (12.6). F com- 
puted by formula (12.6) is equal to /? when ¢ is computed by formula (12.3). 
The numbers of degrees of freedom for F are 1 and (N — 2). The sampling 
distribution of F computed by formula (12.6) is given in Figure 12.7 by the 
F function for one and two degrees of freedom when N — 4, and by the F 
function for one and thirty degrees of freedom: when N — 32. 


E 


THE DISTRIBUTION OF 6$ 


The standard normal deviate, an i-statistic which we shall designate as 8, 
will be discussed next. The standard normal deviate, 8, is a normally dis- 
tributed variable. Its distribution has a mean of zero, a variance equal to one, 
an index of skewness equal to zero, and an index of kurtosis equal to three. 

Only one way of computing ô will be given here and that way is only 
approximate, not exact. For two large, random samples from a bivariate 
population distribution, the ratio 

Z-ZA 
JBKN — 3) 
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is distributed approximately as the standard normal deviate. Therefore, we 
write 

Zi = Zi 
- ABKN — 3) 


understanding that the equality holds only approximately. Z; is computed 
from the product-moment correlation coefficient for the first sample, as 
follows: 


(12.7) 


Z, is computed in the same way from the product-moment correlation 
coefficient for the second sample. N is the number of pairs of values in each 
sample. Figure 12.8 shows a normal curve with zero mean and unit variance, 


0 
ô 


Fig. 12.8 The sampling distribution of the standard normal deviate, 5. The function is a 
normal probability function or normal curve with zero mean and unit variance. 


the function for the sampling distribution of 6. Notice that formula (12.7) 
involves the difference between Z; and Z;. Since Z; is a function of the 
Magnitude of one sample correlation coefficient and Z; is a function of the 
magnitude of the second sample correlation coefficient, ô indirectly compares 
the two sample correlation coefficients. In deducing the sampling distribution 
of 8, there is no requirement that p, the population correlation coefficient, be 
Zero, as there was for t by formula (12.3) and F by formula (12.6). 


The Tabled Functions 


The distribution functions of t, F, and 8 have been studied and information 
about them has been tabulated. In general terms, the tabulated information 
Or a particular function consists of one or more values of the /-statistic, each 
Of which is exceeded by a known proportion of all values in the sampling 
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distribution. We shall comment on the tabled functions in the order: t, F, 
and à. 

Table A in the Appendix presents information for 34 different t functions, 
each one identified by a number of degrees of freedom. For each of these 
t functions, a value of ! is given. Let us call this particular value f,. It has two 
interpretations: one as a positive value, the other as a negative value. Inter- 
preted as a positive value it is known to be exceeded by 0.025 of the totality 
of t values in that sampling distribution. Interpreted as a negative value, it is 
known to exceed 0.025 of the totality of f values in the distribution. Thus the 
proportion of values of t exceeding the positive value of t, plus the proportion 
of values exceeded by the negative value of t, is equal to 0.05. 

Examination of Table A reveals that, for two degrees of freedom, a value 
of t, equal to +4.303 is exceeded by 0.025 of all values of t in that distribution 
and a value of t, equal to —4.303 exceeds 0.025 of all values in the distribu- 
tion. For thirty degrees of freedom, a value of t, equal to 4-2.042 is exceeded 
by 0.025 of all values of t and a value of t, equal to —2.042 exceeds the same 
proportion of values. 

Interpreted with respect to the sampling distribution of t, the tabled values 
of t, provide a basis for classifying all values of t in the distribution. We shall 
return later to this notion of classifying values of an i-statistic. It has great 
importance in the pragmatics of statistical inference. 

Interpreted graphically, the tabled values locate points on the scale of t 
which define areas under the curve. Consider the total area enclosed between 
the curve and the horizontal axis. If we erect perpendiculars to the horizontal 


30 df 


0 
t 


Fig. 12.9 Two / functions for two and thirty degrees of freedom. Perpendiculars have 
been erected at points defined by positive and negative values of /,. The shaded area in 
each tail constitutes 0.025 of the total area under the curve. 
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axis at the points defined by positive and negative values of te, the area to the 
right of the positive value is 0.025 of the total area under the curve and the 
area to the left of the negative value is 0.025 of the total area. Together these 
two tail areas make up 0.05 of the total area. Figure 12.9 shows the / functions 
for two and thirty degrees of freedom. Two perpendiculars have been erected 
for each curve. They were erected at +4.303 and —4.303 on the axis of the 
function for two degrees of freedom, and at +2.042 and —2.042 on the axis 
of the function for thirty degrees of freedom. The tail areas have been shaded. 

Table B, in the Appendix gives information about 168 different F functions, 
each one identified by two numbers of degrees of freedom. For each of these 
F functions, a value of F is given. Let us call this tabled value F.. It is inter- 
preted only as a positive value. It is known to be exceeded by 0.05 of the 
totality of F values in the sampling distribution. 

Examination of Table B, reveals that, for one degree of freedom in the 
numerator and four degrees of freedom in the denominator, a value of Fea 
equal to 7.71 is exceeded by 0.05 of all values of F in that distribution. For 
five degrees of freedom in the numerator and five degrees of freedom in the 
denominator, a value of F, equal to 5.05 is exceeded by 0.05 of the values of 
F in the sampling distribution. (Table B, gives the value of F, exceeded by 
0.025 of values in the distribution.) 

_ The tabled values of F, provide a basis for classifying the totality of values 
in each sampling distribution. We shall have more to say about this classifying 
in a later discussion of pragmatics. Interpreted graphically, the tabled values 


F 


" Fig. 12.10 An F function for five and five degrees of freedom. A perpendicular has been 
ected at a point on the horizontal axis such that the shaded tail area is 0.05 of the total 
area under the curve, 
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locate points on the scale of F. At each point a perpendicular can be erected 
to define areas under the curve. Figure 12.10 shows an F function for five and 
five degrees of freedom. A perpendicular has been erected at the point on the 
horizontal axis, 5.05. The shaded tail area is 0.05 of the total area under the 
curve. 

We have pointed out that there is a family of ¢ functions and a family of 
F functions. Table A provides information about 34 ¢ functions selected from 
the entire family; Table B, provides information about 168 F functions 
selected from the family; Table B, gives information about 100 F functions. 
When the number of degrees of freedom is indefinitely large, t is equal to ô, 
the standard normal deviate. Therefore, the value of ô, can be read from the 
last line of Table A. The value of 8, has two interpretations: one as a positive 
value, the other as a negative value. Interpreted positively, the value of 8, 
exceeded by 0.025 of the values in the sampling distribution is 1.96. Inter- 
preted negatively, the value of 8, which exceeds 0.025 of the values in the 
sampling distribution is —1.96. Figure 12.11 contains the normal curve in 


[°] 
6 
Fig. 12.11 A normal curve with zero mean and unit variance representing the sampling 


distribution of ô. Perpendiculars have been erected at ô = 1.96 and ô = —1.96. Each 
shaded area is 0.025 of the total area under the curve. 


standard form representing the sampling distribution of 8. Perpendiculars 
have been erected at 8, — 1.96 and 8, — — 1.96. Each shaded area constitutes 
0.025 of the total area under the curve. 

Some statistical texts present very extensive tables of the standard normal 
deviate, but not for the same purposes served by the tables of t and F. Since 
our interest in ô is the same as our interest in t and F, we shall give only the 
information which is necessary for classifying values of 8, as indicated above. 

Tables of t and F in the Appendix give values of the i-statistics associated 
with the relative frequency 0.05. In the distribution of 1, this relative frequency 
of 0.05 consists of the relative frequency 0.025 for extreme negative values 
and 0.025 for extreme positive values. In the distribution of F, the relative 
frequency 0.05 stands for extreme values in the positive direction only. Tables 
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of t and F are available, which are more extensive than those given in the 
Appendix. These tables can be more extensive in two ways. They may exhibit 
greater variety in the numbers and combinations of degrees of freedom 
employed. They may present relative frequencies other than and in addition 
to 0.05. Values of an i-statistic are sometimes given for relative frequencies 
such as 0.10, 0.02, and 0.01 in addition to the values for the relative frequency 
0.05. The limited tables presented in the Appendix have been found to be 
quite satisfactory for instructional purposes. The research psychologist would 
do well to have available the most complete tables. Two sources containing 
extensive tables are: R. A. Fisher, and F. Yates, Statistical Tables for Bio- 
logical, Medical, and Agricultural Research (London and Edinburgh: Oliver 
and Boyd, 1948); and G. W. Snedecor, Statistical Methods (Ames, lowa: 
Iowa State College Press, 1946). 


From Theory to Practice 


Up to this point, the discussion has dealt only with the syntactical features 
of sampling error. The problem of realizing these features in practice is a 
Problem of semantics, which remains to be considered. Practical random 
Sampling, as distinguished from theoretical random sampling, will be 
considered in Chapter 13, which follows. 


PRACTICAL RANDOM SAMPLING 


Semantics 15 


There are two major semantical problems in psychological statistics. One 
of these, the problem of confirmation in psychological measurement, was 
discussed in Chapter 5. The other problem has to do with confirmation of the 
correspondence between random sampling theory and random sampling 
practice. The problem of confirmation in sampling theory and practice is the 
source of much misunderstanding, confusion, and incorrect usage in psycho- 
logical research. 

There are two principal ways in which the researcher can go wrong with 
respect to the problem of correspondence between sampling theory and 
practice. First, he may fail to recognize that the problem has to do not only 
with what he thinks, but also with what he does. For him, correspondence 
between theory and practice, or the lack of correspondence, is not a reality. 
He may look upon correspondence as nothing more than a conceptualization. 
He may adopt random sampling theory as no more than a sophisticated view- 
point, currently in vogue in the best research circles. If he does accept the 
reality of correspondence, he may err in thinking that its occurrence is 
fortuitous. He does not fully appreciate the fact that, within certain limits, 
he can actually make practice correspond to theory, and that his failure to do 
so leaves his work vulnerable to criticism and subject to rejection. He sees no 
connection between his choice of research operations and the statistical theory 
on which he will base his evaluation. He may advocate planning in advance 
so that a statistical evaluation is possible, but the demands of such an 
evaluation are not allowed to determine or limit the plans. If he worries at 
all about correspondence between theory and practice, he may attempt to 
rationalize his position or try to inveigle some person of statistical authority 
into doing so for him. 

The person who fails to recognize the nature of the problem becomes 
undiscriminating in evaluating as well as in planning. He fails to coordinate 
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his choice of a method of analysis with the character of his data. Critical 
features of his research operations may vary from one project to another, but 
this variation is not reflected in the kind of analysis he employs. The choice 
between description and inference is not determined by the nature of the 
operations which produce his data. It is often determined by nothing more 
than a preference for a sophisticated method over an unsophisticated one. 

With respect to the correspondence between theory and practice, there are 
no standards, in an absolute sense, but there are standards in a relative sense. 
That is to say, correspondence is obviously better for some research opera- 
tions and some evaluations than for others. It is sometimes possible to be 
discriminating in the choice of operations and thereby improve corres- 
pondence, When that is not possible, one can still be discriminating in his 
Choice of a method of analysis. The discriminating researcher recognizes 
Variation in the quality of research operations and is prepared to choose a 
type of evaluation which reflects this judgment and which is not likely to 
mislead his audience. Description and inference are the two kinds of analysis, 
between which a researcher can choose and thereby coordinate method of 
analysis and quality of operation. 

The second way in which the researcher can go wrong relates to his 
misunderstanding of the consequences of various kinds of failure in corres- 
Pondence between theory and practice. There is no way of assessing over-all 
Correspondence, Practice may correspond to theory in one respect but not 
another. Furthermore, the degree of correspondence may vary. There is 
available substantial evidence as to the consequences of certain kinds and 
degrees of failure in correspondence. The consequences of one kind are known 
to be negligible. The consequences of another may be disastrous. All of this 
means that the psychologist must put the issues in a proper order; that is, he 
Must put the more serious issues ahead of the less serious ones. He should, 
of Course, try to satisfy all requirements perfectly, but if he falls short of the 
ideal, he should not become preoccupied with a trivial failure and ignore a 
Serious one, 


Dimensions of Correspondence 


We said above that there is no way of assessing over-all correspondence 
between random sampling theory and practice. Theoretical random sampling, 
as it was presented in Chapter 11, involved a known population distribution 
and a prescribed deductive system for obtaining the sampling distribution of 
Some i-statistic. In Chapter 12, the population distribution was specified as a 
pu vans and the deduced sampling distributions were the f function, the 

unction, and the normal curve representing the standard normal deviate. 
(See formulas 12.1 to 12.7.) For convenience in discussion, let us divide the 
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problem of correspondence into two parts: the characteristics of the popula- 
tion distribution and the system of obtaining samples. 

A normal curve is a continuous function, unimodal and symmetrical. (See 
Figure 12.1.) Its index of skewness is zero. Its index of kurtosis is three. 
Suppose the population distribution is not a normal curve. Suppose there is, 
as we say, some departure from normality. What are the consequences? Are 
they serious? 

Psychological measurements are always discrete variables, not continuous 
ones. The degree of discontinuity in the population may be great, as when the 
number of score intervals is two, or it may be small, as when the number 
of score intervals is one hundred. Whether the population discontinuity is 
great or small, the effect on a sampling distribution depends on the size of 
the sample. For any given number of score intervals, as sample size increases, 
the discontinuity of a sampling distribution diminishes. Furthermore, the 
effect will vary with the particular i-statistic under consideration. There is, 
of course, no absolute standard for deciding whether an effect is negligible or 
serious. Having surveyed the available evidence, we take the position here 
that the effect of discontinuity is negligible (1) when the number of values per 
sample is equal to or greater than twenty and the number of score intervals 
is equal to or greater than five; and (2) when the number of values per sample 
is equal to or greater than five and the number of score intervals is equal to 
or greater than twenty. The choice of these values is, to some extent, arbitrary. 
They are given simply as guides for the student who might be expected to 
form his own standards eventually after he accumulates more experience. 
There is evidence as to the specific effects of discontinuity on the sampling 
distributions of a variety of i-statistics, but presentation of that evidence 
would take us beyond the reasonable scope of this discussion. 

Departures from normality with respect to the form or shape of the 
population distribution are usually given in terms of skewness and kurtosis. 
There is no question that the distributions of some psychological measures for 
some large groups of subjects do display nonnormal characteristics. Many 
distributions encountered in practice exhibit a slight to moderate degree of 
skewness and a slight to moderate degree of flatness. Only rarely does one 
encounter a so-called J-shaped or U-shaped distribution. The departure from 
normality in terms of skewness and kurtosis does affect the sampling distribu- 
tion of those i-statistics presented in Chapter 12. Again this effect depends 
on the size of the sample. As sample size increases the effect becomes smaller. 
Furthermore, the effect of skewness and kurtosis varies depending on the 
i-statistic in question. There is no absolute standard for deciding how serious 
the effects are. After a considerable amount of independent research on this 
issue and an extensive study of the available evidence as to the effects of 
skewness and kurtosis, we have taken a position which can be summarized, 
as follows: 
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1. For the slight to moderate departures from normality ordinarily en- 
countered in practice, the effect on the sampling distributions of t and F, the 
multiple-sample i-statistics which compare sample means through formulas 
(12.2) and (12.4), is negligible when there are ten or more values per sample. 

2. For the departures from normality encountered in practice, the effect 
on the sampling distribution of t, the single-sample i-statistic computed by 
formula (12.1), is more serious than the effect on the multiple-sample t and 
F, but is negligible when there are thirty or more values in the sample. 

3. The effect of moderate departures from normality on the sampling 
distribution of F, the ratio of two sample variances, as given by formula 
(12.5), is serious for very small samples, but is negligible for samples of fifty 
values or more. 

4. The effect of moderate degrees of nonnormality on the sampling 
distribution of t, as in formula (12.3), or of F, as in formula (12.6), computed 
from a sample, product-moment coefficient of correlation, is negligible when 
the sample contains thirty or more values. 

The arbitrary standards stated above obviously do not cover all situations. 
There is a vast amount of rational and empirical evidence concerning the 
effects of skewness and kurtosis on various sampling distributions, but we 
cannot do justice to the evidence here. While the standards suggested above 
are necessarily somewhat arbitrary, they have been adopted only after very 
Serious study of the available evidence. There are some conspicuous gaps in 
the evidence on the effects of nonnormality. For example,we have not found 
evidence concerning the effect on the sampling distribution of 8, as computed 
from two sample correlation coefficients by formula (12.7). 

When we say that the sampling distribution of a given i-statistic.is affected 
by nonnormality, we mean that the shaded areas in curves such as those of 
Figures 12.9, 12.10, and 12.11 may, under conditions of nonnormality in the 
Population distribution, not be exactly what we expect them to be under the 
ideal, theoretical conditions represented by Tables A and B. When we say 
that the effects are negligible, we mean that the shaded areas, as a proportion 
of the total area, may deviate from 0.05, but will lie between 0.04 and 0.06. 
Again the student must be cautioned not to interpret these values as absolutes. 
In our judgment, the effect would be serious if the shaded areas constituted 
as much as 0.10 of the total area when the ideal, theoretical value was 0.05. 

As it was described in Chapter 11, theoretical random sampling involved 
determining, by mathematical deduction, the nature of the sampling distribu- 
tion of a given i-statistic. Notice that a random sample was defined as any 
logically possible set of n values. It was not defined as a set of numbers con- 
Sisting of typical values or as any particular combination of values. The 
Sampling distribution of some given /-statistic is itself a frequency distribution 
Of the values of that i-statistic computed for each one of all possible, logically 

determined samples. What, then, is random sampling in practice? Is there 
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some set of concrete operations which deserve to be labeled as random 
sampling procedures and distinguished from other operations? 

At the risk of seeming dogmatic, we shall define practical random sampling 
as selection by means of a tested mechanical system. The system involves 
concrete operations which are instituted and managed by the investigator. 
Before the system is ever used in research, it is tested to determine whether or 
not it will produce empirically, for a given i-statistic, a sampling distribution 
which corresponds to the mathematically deduced, theoretical sampling 
distribution. If there is substantial evidence of correspondence between the 
empirical sampling distribution and the theoretical sampling distribution, the 
mechanical system passes the test. 

In real sampling, as distinguished from theoretical sampling, the selections 
are made from a population consisting of a large but finite number of existent 
and accessible units which can be measured. In psychological research, the 
unit of the population is a person, an animal, a behavioral event, or a 
behavioral product. The number of units selected is usually small relative to 
the number in the population. The population is real. That is, it exists and 
the units of the population are all equally accessible to the investigator doing 
the selecting. 


The Operations of Real Sampling 


The mechanical system most widely used for selecting units from a popula- 
tion is one whose central feature is a so-called table of random numbers. ^ 
table of random numbers is a collection of digits with values from 0 to 9; 
The collection is usually generated, initially, by some mechanical device, such 
as a lottery, a die, or a roulette wheel, or by similar electronic equipment. The 
table is tested to determine whether it can be used to generate one or more 
appropriate sampling distributions. If repeated use of the table does produce 
an approximation to the desired distribution, then use of the table for 
sampling purposes is considered acceptable. That is, the correspondence 
between theoretical sampling and practical sampling is taken to be confirmed. 

Broadly viewed, the system of real sampling has three phases: (1) units in 
the population are enumerated; (2) numbers are read from a table of random 
digits; and (3) units in the population are identified. The specific steps in 
these three phases are given below. 

1. Enumeration. Assign ordinal numbers to the units of the population in 
any arbitrary fashion. For example, if the population consisted of 9,253 
students listed in a university directory, each of the ordinal numbers from 
0000 to 9,252 could be written beside the name of a student. Since the assign- 
ment may be arbitrary, the simplest procedure would be the best. Thus one 
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could begin at the beginning of the alphabetized directory and assign 
successive numbers to the names in alphabetical order. 

2. Reading. Decide in advance the direction in which you will read numbers 
from the table of random digits and what you will do at the end of an array, 
at the end of a page, and at the end of the table. Choose a starting point in 
the table by some mechanical means. For example, a die could be used. The 
die could be thrown several times to indicate the starting page, row, and 
column. Next read groups of digits of a size appropriate to your needs. For 
example, if the population consists of 9,253 students enumerated from 0000 
to 9,252, then you should read groups of four digits. It is usually advisable 
to record these groups in the order of their occurrence in the table. Read and 
record as many groups of digits as there are to be subjects in the sample. If 
the sample is to consist of 20 subjects, then it is necessary to read and record 
20 groups of four digits from the table of random numbers. Numbers which 
recur and numbers which do not appear in the enumeration can be ignored. 

3. Identification and selection. Take each group of digits and identify the 
unit bearing the same number in the enumerated population. Referring again 
to the population of 9,253 students, if one group of digits is 0389, look for 
the student whose name appears opposite that same ordinal number in the 
enumeration. He would be the 390!^ in the listing of the directory. Thus each 
group of digits identifies a subject for the sample. 


A DEMONSTRATION OF REAL SAMPLING 


Because of the importance of this system for obtaining samples and 
because students often have difficulty finding an account of the system, we 
Shall go to some lengths to describe and discuss a demonstration of it. 

We might demonstrate real sampling in any one of three ways: 

l. Using a large group of real subjects as the population. . . 

Example: Let the population consist of all male undergraduates listed in 
the student directory of University M. These students can be enumerated by 
Writing ordinal numbers opposite their names. A sample of n names can be 
obtained by means of a table of random digits. Students whose names are 
Selected are measured on a response variable, X. An i-statistic is computed 
from the sample of real measures. 

If it is desired, the sampling can be repeated many times. Each time, the 
Value of the specified i-statistic can be computed. These values of the 
Statistic can be arranged in a frequency distribution. This frequency 
distribution is actually an empirical sampling distribution for the specified 
t-statistic, 

2. Using a frequency distribution of real measures as the population. 

Example; Let the population consist of scores obtained from the files of 

!* admissions office for students admitted to University M in September 
- Ordinal numbers are assigned to these scores. A sample of n scores is 
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obtained by means ofa table of random digits. An i-statistic is computed from 
the sample of n scores. 

If it was desired, the sampling could be repeated and an empirical sampling 
distribution for the i-statistic could be established. 

3. Using an artificial frequency distribution of values as the population. 
The student should understand that we are concerned at this point with the 
possibility of demonstrating real sampling. It is entirely possible to do real 
sampling from an artificial population. Of course, sampling from an artificial 
population is only of interest as a demonstration, whereas sampling from a 
large group of real students might be an integral part of a serious research 
design. As a matter of convenience in providing a demonstration of real 
sampling in this text, we shall employ an artificial frequency distribution of 
values as the population. 

Example: An artificial population distribution of 10,000 values is given in 
Table 13.1. The value of the variable X is given in the first column. Score 


Table 13.1 
AN ARTIFICIAL POPULATION OF 10,000 VALUES 


X T Enumeration® 
9 30 0000-0029 
8 150 0030-0179 
7 630 0180-0809 
6 1,610 0810-2,419 
5 2,580 2,420-4,999 
4 2,580 5,000-7,579 
3 1,610 7,580-9,189 
2 630 9,190-9,819 
l 150 9,820-9,969 
0 30 9,970-9,999 


N = Zf = 10,000. 
“Ordinal numbers have been assigned to the values in each score interval. 


intervals run from 0 to 9. The frequency for each of the 10 score intervals is 
given in the second column. The enumeration of these 10,000 values of X has 
been carried out as indicated in the third column. Ordinal numbers from 0000 
to 9,999 have been assigned to the values. Notice that the 30 values of X — 9 
have been numbered from 0000 to 0029, that the 150 values of X — 8 have 
been numbered from 0030 to 0179, and so on. 

To guide the selection of a sample from the artificial population distribution 
of Table 13.1, we need a table of random digits. One of the most extensive 
tables presently available has been published by the RAND Corporation (1). 
We recommend it to the student. It consists of 400 pages, each containing 
2,500 digits arranged in 50 rows and 50 columns. 

Before we choose a starting point in the RAND table, we must agree on 
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à direction for reading. There are many legitimate ways of reading. On this 
occasion, we choose to read down a column. When we reach the bottom of 
à column, we shall go to the top of the next column to the right and again 
read down. When we reach the end of a page, we shall go to the top of the 
first column of the next page. If we should reach the end of the table, we shall 
o to the top of the first column of the first page. 


04053 17715 
31727 22402 
48972 14109 
35099 31960 
20483 88049 


68653 97277 
27371 05095 
41026 60713 
06875 44626 
64218 57810 


52402 60583 
68304 38817 
23596 97483 
21925 00748 
32992 15761 


43400 
11241 
04845 
51506 
17811 


62431 
87439 
56004 
99143 
15692 


71761 
19251 
30567 
67909 
19539 


44330 
06733 
81938 
85563 
42937 


11279 
27400 
24130 
74420 
64062 


81185 
15002 
72830 
98377 
83753 


34522 
61640 
34052 
59727 
70031 


72251 
71329 
82914 
25641 
15082 


33176 
13560 
48193 
76732 
37828 


47023 
98748 
86723 
42090 
34254 


04679 
21566 
41836 
83185 
05593 


97328 
92991 
98733 
28108 
27700 


Table 13.2 
PAGE 315 OF THE RAND TABLE OF RANDOM DIGITS« 


68089 
02827 
01783 
29550 
02221 


36814 
85939 
00715 
71349 
68878 


75982 
11777 
72614 
28409 
11066 


30645 
25280 
26893 
32553 
51279 


55813 
98177 
54027 
82692 
00297 


27471 
95177 
74928 
75949 
41183 


30355 
50795 
00579 
88060 
30110 


43076 
64115 
83161 
14100 
08555 


16737 
05110 
02713 
60393 
07225 


72908 
27477 
36819 
61305 
72123 


59654 
23154 
38882 
73823 
98838 


78030 
36298 
01880 
73331 
93953 


20322 
70134 
14707 
21648 
77697 


68312 
81106 
76961 
49379 
01203 


28581 
87031 
63404 
88008 
00852 


27689 
27120 
38430 
35799 
91317 


90215 
81545 
49655 
88512 
58142 


72060 
16537 
99084 
34178 
62126 


52333 
16582 
56638 
23421 
63966 


89337 
90143 
58121 
13105 
61016 


99542 
23711 
56120 
24908 
57776 


81053 
59275 
68672 
06371 
44412 


45054 
43045 
14151 
65023 
72612 


02589 
88473 
20028 
46882 
78326 


76076 
69014 
59995 
43386 
94679 


68758 
48199 
41355 
36046 
74250 


06190 
79992 
92247 
75505 
71082 


34433 
50174 
86559 
90528 
12396 


10847 
27143 
93316 
31227 
53560 


46817 
64426 
46629 
22794 
83809 


"Lines of the original RAND table are numbered. These numbers have been 
from the table as it is reproduced above. Adapted with permission from the RAND 
Corporation, A Million Random Digits (Glencoe, Ill.: The Free Press, 1955), p. 315. 


91727 
55896 
56408 
41416 
10815 


54435 
60039 
06459 
28527 
64415 


33634 
13079 
65664 
25513 
43426 


23774 
10209 
73539 
03077 
07558 


62762 
44262 
92291 
68763 
53124 


24717 
50984 
62639 
30529 
69886 


omitted 
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Table 13.2 
PAGE 315 OF THE RAND TABLE OF RANDOM DIGITS (continued) 


25618 40076 81223 34072 58189 09393 38528 09972 55984 77363 
64302 92307 15246 15004 45925 65448 61462 38861 18612 90501 
26803 58650 80410 04526 99352 25352 56848 25675 88086 89600 
86214 24449 37100 30921 16325 65987 07534 24678 23150 20902 
10817 66999 12056 22844 03847 33336 52092 24251 49702 24384 


22677 51371 47284 22063 07727 31379 90462 65657 41798 00212 
19193 79282 30388 60077 25370 03559 03151 61731 42763 24632 
19052 65518 36005 78326 81155 97428 98679 15903 34641 51083 
03231 80995 66618 13905 90549 29499 88947 66575 02525 08398 
88063 16412 99869 38663 82026 06803 18966 78992 21280 29960 


26529 35993 24584 00452 11108 75438 06078 98785 38040 93746 
84241 57374 39160 27898 27412 35091 06716 78165 24196 12540 
30581 12894 02779 03668 48223 88050 00591 07738 87437 60512 
13719 64755 29962 32118 18738 30514 33012 89132 84340 97263 
14983 13884 48457 81853 74278 33136 20349 71379 97447 42103 


47762 02694 52343 62711 09088 35045 75288 00758 76053 60504 
26793 37177 06115 74306 79880 93933 64200 06100 40161 08666 
13144 61888 93496 34850 97094 50528 26382 64887 71587 21951 
93754 22003 38235 87426 44051 03441 65669 16972 30122 16988 
77437 95511 58403 56003 22978 30781 72495 67801 67423 20297 


We now proceed with the selection of a starting point. There are 400 pages 
of random digits in the RAND table. We divide the 400 pages into five 
sections of 80 pages each, and throw a die to choose a section. We throw a 
" 4," thereby choosing the fourth section of 80 pages from page 241 to 
page 320. (If we had obtained a ** 6 " on the first throw, we would have ignored 
it and thrown again.) We divide the 80 pages into five sections of 16 each and 
throw the die. This time we obtain a '* 5," thereby choosing pages 305 to 320. 
We divide these 16 pages into four sections and throw the die. A “ 3" comes 
up and our choice has been narrowed to the four pages: 313, 314, 315, and 
316. Another throw gives us a “ 6 " which we ignore. Another throw yields 
“ 3" and page 315 is chosen for the starting point. 

On page 315 of the RAND table there are 2,500 digits arranged in 50 rows 
and 50 columns. This arrangement is reproduced in our Table 13.2. Notice 
the grouping of rows and columns produced by the spacing. This grouping 
only serves the purpose of making the table easier to read. We shall take 
advantage of the grouping to locate our starting point. Five major columns 
can be distinguished. A throw of the die produces a ** 1 " and puts us in the 
first major column of the page. The 10 major rows can be divided into five 
pairs. A throw of the die yields a “ 3 ’’ and puts us in the middle pair of the 
five pairs of major rows. Our choice has now been narrowed down to the 
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arrangement of digits set apart by the rectangle of dotted lines in Table 13.2. 
There are two major columns in this arrangement. Let the left column be 
designated “ odd " and the right column, “ even." A throw of the die yields 
a "2" and we choose the right column. There are two major rows in the 
column. Let the upper row be “ odd " and the lower row be “ even." The die 
says “ 3" and we choose the upper row. There remains now a block of digits 
with five rows and five columns. A throw of the die for the column gives a 
" 3." Another throw for the row gives a “ 2 " and the starting point is finally 
determined. The digit which serves as a starting point is a “ 7 " in the twenty- 
second row and the eighth column of the 50 x 50 arrangement on page 315 
of the RAND table. 

Let us suppose that we want a sample of 10 values from the population of 
Table 13.1. We require 10 groups of four digits from the table of random 
digits, starting with the “ 7." Reading down the column yields the following 
10 groups: 7502, 2733, 8036, 4932, 5949, 3878, 6180, 5100, 6479, and 1218. 

By means of these 10 numbers, we can now identify 10 values of X in the 
population distribution of Table 13.1. The first number read from the table 
of random digits is 7502. In the distribution of Table 13.1, the value of X 
numbered 7,502 is 4. The second number is 2733. The value of X numbered 
2,733 is 5. Thus we can determine 10 values of X to which we were directed 
by the 10 groups of digits read from the table of random digits. Table 13.3 


Table 13.3 


A RANDOM SAMPLE OF TEN VALUES 
FROM THE POPULATION IN TABLE 13.1 


Random Corresponding 

Numbers Values of X 
7502 
2733 
8036 
4932 
5949 
3878 
6180 
5100 
6479 
1218 


Ov RR UA dA CA Q2 CA 8 


8lves the 10 groups of four digits from the table of random digits and the 

Corresponding values of X from the population in Table 13.1. The 10 values 

of X constitute a real sample from the artificial population. 

mum ds obtained a random sample of 10 values in this demonstration, we 

di compute three i-statistics from those values. Among the i-statistics 
Scussed in Chapter 12, there are three which can be computed from a single 
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sample of values of X. The three possibilities include M., the sample mean; 
Vi, the sample variance computed with degrees of freedom; and f, a ratio 
computed by formula (12.1). To complete the demonstration, we shall com- 
pute f. In the process of computing t, we shall have to compute the other two: 
M, and V}. Formula (12.1) is 

Mz — pF. 


AV iin 


The required values can be computed, as follows: 


_ EX 44 


M, n = T = 44 
EX» 2 
ox? = LX? a E 200 - 6.4; 
n 10 
Xx 6.4 
y; = = 2f = 0711; and 
H—1 9 
Bo =45 


Substituting these values in formula (12.1) yields the following result: 


M.—pn 44—45 _ 


P _ “= —0315. 
JVin JOTO 


The value of ¢ computed for a random sample from the population in Table 
13.1 is —0.375. This completes the demonstration of the procedure for 
obtaining a single random sample. 

If we wished to generate an empirical sampling distribution of /, we would 
repeat the selection of a sample many times and compute for each sample a 
value of rt. The frequency distribution of a very large number of values of t, 
so obtained, would constitute an empirical sampling distribution, the counter- 
part of a theoretical t function. We shall not attempt here to generate a com- 
plete sampling distribution but, to illustrate the results of repeated sampling 
on a limited scale, we have selected four additional samples of 10 values from 
the population in Table 13.1. Table 13.4 shows the numbers read from page 
315 of the RAND table and the corresponding values of X from the popula- 
tion. The student can verify that the random numbers for the four additional 
samples follow those of the first sample in the RAND table. Table 13.5 con- 
tains values of M,, V}, and t for each of the five samples. Figure 13.1 shows 
a frequency distribution of the five values of t obtained from the five random 
samples. The student should verify the computation of the f values from the 
four additional samples. 

It is important to understand that each of the five sets of values discussed 
above qualify as random samples because of the method used in selecting the 
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values and not because of any characteristics of the values themselves. Like- 
wise, the variability of the ¢ values in Figure 13.1 is random variability because 
of the method used in selecting and not because of any characteristics of the 
Table 13.4 
FOUR ADDITIONAL RANDOM SAMPLES^ 


Sample 2 Sample 3 

Random Corresponding Random Corresponding 
Numbers Values of X Numbers Values of X 

1846 6 2909 5 

3438 5 7536 4 

6895 4 0373 7 

5488 4 8143 3 

9937 1 2339 6 

0549 7 6795 4 

7819 3 8496 3 

1979 6 7670 3 

5897 4 9912 1 

8015 3 8523 3 

Sample 4 Sample 5 

Random Corresponding Random Corresponding 
Numbers Values of X Numbers Values of X 

4454 5 1831 6 

4783 5 4336 5 

1410 6 9230 Z 

5168 4 2450 5 

5917 4 9353 2 

1361 6 1417 6 

4088 5 2769 5 

4122 5 5190 4 

7681 3 7946 3 

7988 3 1521 6 


"Population in Table 13.1. 


Table 13.5 
VALUES OF Mx, Vi, AND t FOR THE FIVE RANDOM SAMPLES? 


Sample 
1 2 3 4 5 
M; 4.4 4.3 3.9 4.6 44 
Vi 0.711 3.122 2.989 1.156 2.489 
t 0.375 0.358 1.097 0.294 —0.200 


“Population in Table 13.1 ; sample values in Table 13.4. 
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Fig. 13.1 The distribution of t values computed from the five random samples in Tables 
13.2 and 13.3. (Values have been grouped into intervals of 0.1.) 


values themselves. Thus i-statistics, as values computed from random samples, 
can be expected to vary. As will be shown later, this circumstance constitutes 
the foundation or background for all uses of statistical inference. The parti- 
cular operations devised by the research psychologist to answer a question, 
with the aid of statistical inference, are always superimposed on the operations 
which insure that error variability is randomly distributed and will affect the 
outcome in a random fashion. There is no pretending these procedures 
eliminate uncertainty in the evaluation of research results. The uncertainty 
is there in the form of random variation. At the same time, however, the fact 
that the uncertainty arises from random variation insures a kind of rigor for 
the evaluation in the sense that the distribution of outcomes strictly due to 
error is known. One might say that the nature of the uncertainty is well 


understood. 


Perspective 


The purpose of the demonstration of the use of a table of random digits in 
sampling was to focus the student's attention on the concrete operations of 
real sampling. To do so, within this text, it seemed most convenient to use an 
artificial population—the frequency distribution of Table 13.1. This use of 
an artificial population in a demonstration of real sampling deserves comment. 
Beginning students are sometimes confused by demonstrations involving 
artificial elements or devices. They sometimes mistakenly conclude that 
sampling in psychological research must somehow involve an artificial dis- 
tribution like the one in Table 13.1. The fact is that sampling in psychological 
research involves the selection procedures of the demonstration applied to a 
population of organisms, behavioral events, or behavioral products instead 
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of the artificial population. They also sometimes mistakenly conclude that the 
purpose of a demonstration is simply to express and to dramatize an assump- 
tion. This is not true. We repeat that the purpose is to direct attention to and 
explain the use of a table of random digits in the set of concrete operations 
Which should be employed for sampling in psychological research. 


The Random Interval Procedure 


A table of random digits can be used correctly in a variety of ways, too 
numerous to list or discuss here. One variation in procedure deserves 
mention, however, because of its convenience in certain situations. One can 
Sometimes avoid the burdensome task of actually enumerating the units of 
the population. For example, suppose one had a list of 5,000 voters and 
wanted a sample constituting 20 per cent of the total number. Taking every 
fifth name in the list would yield the desired number of subjects, but such 
à procedure is not acceptable because it cannot be subjected to a general test 
of randomness and, on any given occasion, it could be quite unsatisfactory. 
Instead of every fifth name, however, one might use a random interval, one 
determined by the succession of digits in a tested table. 

The random interval procedure involves the following steps: Adopt a 
direction for reading and choose a starting point in the table of random digits. 
Agree to read the digit “0” as “ 10.” Read the first digit and count that 
number of names beginning with the first name in the list. The last name 
Counted goes in the sample. Read the second digit and count again, starting 
with the next name in the list. The last name counted goes in the sample. 
Read the third digit from the table, count names, and so on. The number of 
names skipped will vary from 0 to 9, but, over a large number of selections, 
will average very close to four names, thus insuring that approximately one 
fifth of the names will be chosen. The procedure can be adapted to other 
Population sizes and sample sizes by reading some combinations of digits and 
disregarding other combinations. 


Other Ways of Distributing Variability Randomly 


There are two situations in which the investigator does not actually select 
a sample from a population but can and does insure the random distribution 
of certain kinds of variability over values he wishes to compute. When either 
of these two situations obtain, it is considered quite acceptable statistical 
Practice to treat values computed from the collection of observations as 
"statistics, that is, as values computed from random samples. The student 
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will undoubtedly experience some difficulty in perceiving any equivalence in 
these several approaches. One reason is that the equivalence is not exact. 
Another reason is that the equivalence of the actual procedures cannot be 
deduced mathematically. Another is that the common acceptance and 
justification of the equivalence are, to some extent, based on experience. 
Finally, there is a certain exercise of common sense and argument here which 
the student will not likely be prepared to grasp immediately. 

The first situation is one in which the investigator has a group of subjects 
which he wishes to divide into two (or more) subgroups so that the final 
difference between the subgroups constitutes a random event. He makes the 
division by sorting names into subgroups, with the sorting guided by a succes- 
sion of numbers read from a table of random digits. If he wishes to sort into 
two subgroups, he may read digits as “ odd " and “ even," and sort successive 
names in the list into odd and even subgroups, as dictated by the succession 
of digits. If he wishes to sort into three subgroups, he may read single digits, 
ignoring all but 1’s, 2's, and 3's, and sort successive names into groups 
numbered “ 1," “ 2," and “ 3." Since there is nothing to do when all but one 
subgroup is filled but put the remaining names into that subgroup, it is 
necessary to randomize the original order in which names are listed. If 
the names are put on cards, then the cards could be well shuffled initially and 
then actually sorted into piles as dictated by the random digits. 

When a group of subjects is divided randomly into subgroups, these sub- 
groups are often treated as random samples and values computed from them 
are considered to be i-statistics. One may reasonably ask, What constitutes 
the population in this case? There is no population in the sense implied by the 
original definition of the word. There is a population, however, in the sense 
of a totality of logically possible arrangements of N subjects into c subgroups 
of n subjects each. For example, if N = 20, c = 2, and n = 10, then the total 
number of logically possible arrangements is 20!/10!10! = 184,756. There 
has been some study of the relation between sampling distributions for 
i-statistics based on the theory of a normally distributed population and 
sampling distributions for i-statistics based on the theory of a population of 
arrangements. Under certain conditions, the relation is quite satisfactory and 
supports the practice just described. 

Earlier, in connection with the discussion of selecting samples from a 
population, it was said that the population was real and accessible. The 
population of arrangements is obviously different. It is real, in the sense that 
any one arrangement can actually occur, but it is not accessible, in the sense 
that all arrangements are simultaneously available. 

The second situation is one in which the investigator wishes to obtain 
observations on n subjects in succession. There are, therefore, n occasions of 


!Factorial notation was briefly explained in Chapter 2. Example: $1 — 5x 4x 3x2x 
1 = 120. 


— 
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measurement, each of which is associated with an error of some unknown 
magnitude. It is the investigator's desire to distribute errors associated with 
Occasions of measurement randomly over the subjects to be measured. To do 
so, he assigns ordinal numbers to the subjects arbitrarily and then determines 
the order of occurrence of these numbers in a table of random digits. Their 
order of occurrence is the order in which he measures the subjects. Usually 
the research requires at least one repetition of the whole procedure. Certain 
values computed from the entire collection of observations qualify as i- 
statistics. Here the population is the totality of orders into which n objects 
can be placed. The number is 7! The population of orders is real in the sense 
that any one of them can occur, but it is not accessible, in the sense that all 
orders are simultaneously available. 


Incorrect Practice 


When a tested mechanical device like a table of random digits is employed, 
random selection, random sorting, and random ordering, as well as com- 
binations of these, constitute the admissible procedures for insuring semantical 
Correspondence between theoretical sampling and practical sampling. What 
are the consequences of failure in this correspondence? When the procedure 
used has not been tested or when the investigator has done nothing to meet 
this requirement, then the consequences are unpredictable. Unintended and 
unwanted differences in the observations, and in the values computed from 
them, may be small or large, may operate in one direction or another, and 
may be consistent or inconsistent. One would not know what to expect. The 
shaded areas in the tails of the distributions of Figures 12.9, 12.10, and 12.11, 
and the form of those distributions, would be indeterminate. As a matter of 
fact, it would be more nearly correct to say these distributions do not then 
exist, 

Because of the drastic consequences when there is a failure in the sampling 
aspect of the problem of correspondence, the investigator should place this 
Issue first among his concerns. Since departures from normality in the dis- 
tribution of errors may not be serious, if he can insure that the unintended 
variability in his observations is random, he has taken a giant step toward 
Insuring correspondence between sampling theory and practice. If he can also 
insure that discontinuity and other departures from normality are negligible, 
SO much the better. The frequent practice of ignoring the requirement that 
Variability be distributed randomly and, at the same time, investing the prob- 
lem of nonnormality with exaggerated importance cannot be defended. 
um have something more to say about this problem of distributing 

ility randomly. We believe it is incorrect and indefensible for a psycho- 
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logist to call a set of measures a random sample when he has done nothing 
to insure that the set of measures has the properties of a random sample. He 
may try to justify himself by saying that the set of measures can be thought 
of as a random sample from some hypothetical population. In doing so, he is 
referring to the particular values of the set and is considering them as typical 
of some imaginary population. He is mistaken in defining a random sample 
in terms of the values in it. He is completely disregarding the method of 
obtaining the values. Actually the populations which could give rise to a 
sample containing certain specified values are innumerable. A random 
sample, theoretical or practical, is defined by the procedures of selection. 
Calling a set of measures a random sample, when nothing has been done to 
insure that the set of nieasures has been selected as it should have been, is 
simply unrealistic. 


Transformations 


Certain kinds and degrees of nonnormality can be reduced or eliminated 
by applying one of several transformations to the observations before 
i-statistics are computed. In Chapter 4, several examples were given of sets 
of numbers whose distributions were asymmetrical originally, but became 
symmetrical when the transformations were applied. The application of trans- 
formations to real measures cannot be expected to produce perfect symmetry 
as was true of the examples of Chapter 4. At best, the skewness of a distribu- 
tion may only be reduced somewhat and the final distribution may only be 
approximately normal. Despite these limitations, one should be alert for 
indications that the departure from normality is such that a proper trans- 
formation will improve the distribution and bring about closer correspondence 
between theory and practice. 


Summary 


As regards sampling theory and practice, the problem of correspondence 
has two parts: the nature of the population distribution and the properties 
of the method of selecting samples from that population. With respect to the 
first part of the problem, sampling distributions of i-statistics are affected by 
departures from normality such as discontinuity, skewness, and kurtosis. The 
effects are much less serious for certain i-statistics than for others. The effects 
become less serious as the size of the sample(s) increases. Our conclusion is 
that, under many conditions encountered in actual practice, failure in the 
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requirement of a normal distribution is not so serious as to preclude the use 
of sampling theory in the evaluation of research. With respect to the second 
part of the problem, sampling distributions do not exist when there are no 
operations which insure that variability is randomly distributed. Our con- 
clusion is that sampling theory does not apply when there is a failure in the 
requirement that variability be randomly distributed. 
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The research psychologist can never be certain that his observations of 
behavior are uncontaminated by error. No matter how careful he may be 
in planning and conducting a study, a multitude of influences, unintended and 
unwanted by the researcher, produces in his data effects of unknown magni- 
tude. These unintended differences or biases make interpretation of the results 
of research quite difficult and introduce into the interpretation a degree of 
uncertainty which cannot be eliminated. 

Sources of error in psychological research can be logically classified into 
two categories: differences which arise from the selection of subjects, and 
extraneous influences which come to bear upon an observation at the par- 
ticular time and place it is made. Because there is no way of insuring that 
these biases will be completely eliminated from the data, the results of any 
investigation are, strictly speaking, specific to that endeavor. The results 
depend on the particular subjects studied and the particular occasions on 
which observations were obtained. The research psychologist cannot, of 
course, be satisfied with this limitation. His goal is the acquisition of 
knowledge in the form of general principles and laws. 

Inductive inference, in science, is the act of reasoning in a logical fashion 
from the individual event to the universal event. It begins with a particular 


set of observations of some natural phenomenon and concludes with a 
that phenomenon or its class. It is the process 


general proposition concerning 
ith 


whereby the scientist hopes to transcend the limitations of his data w 


respect to their specificity. 

There is now wide acceptance, in the behavioral sciences, of the proposition 
that the process of induction should take place in a context of theory and 
practice provided by statistical inference. This does not mean that inductive 
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inference cannot be accomplished except through statistical procedures. 
Physical scientists have not been dependent on statistics for the high order of 
the inductive inferences which have characterized their research. There is a 
great deal more to research and the acquisition of knowledge than statistical 
manipulation of data. There is little doubt, however, that, in the analysis of 
data and in certain formal aspects of research design, the methods of 
Statistical inference represent the most rigorous logic men have yet been able 
to achieve in this connection. 

It is, of course, in the very nature of inference that it is accompanied by 
uncertainty and that it is subject to error. Statistical inference shares these 
faults. There is a sense, however, in which statistical inference can be said to 
be rigorous. In any given investigation, the probability of an incorrect 
inference is known and stated. 

Before we become involved in the specific applications of statistical inference 
in Psychological research, it would be well to consider two distinctions which 
are extremely important for an understanding of these applications. We can 
distinguish two kinds of procedures in statistical inference: the testing of 
Significance and the estimating of population values or parameters. We can 
also distinguish two kinds of research: experimental research and empirical 
Tesearch. These terms will be defined and discussed later at length. At the 
Moment, it would be appropriate to examine the relation between the 
Statistical procedures and the two kinds of research. 

Since the two statistical procedures can be employed in the evaluation of 
either kind of research, the subject matter of statistical inference and its 
applications in research can be divided into four topics representing the four 
Possible combinations of procedure and kind of research. These four topics 
are given by the following brief outline: 


A. Experimental research 

l. Testing significance 

2. Estimating parameters 
B. Empirical research 

l. Testing significance 

2. Estimating parameters 


Although the relative importance of these four topics might be debated, 
it is our opinion that a careful survey of the actual research being done by 
Psychologists and the kinds of statistical procedures being used in the evalua- 
tion of that research would reveal that testing significance occurs much more 
often than estimating parameters. There is a reasonable explanation for this 
one-sided emphasis on testing significance and some comment will be made 

Y way of explanation of the situation in Chapter 16. Furthermore, although 

Ste is probably some general inclination for research psychologists to view 
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experimental research as being more important than nonexperimental re- 
search, we take the position here that both are important and merit discussion, 
as long as they are clearly distinguished. Under the assumption that both 
kinds of research are important, but that testing of significance is generally 
of more interest to the research psychologist than estimating parameters, we 
shall emphasize the testing of significance in separate discussions of experi- 
mental and empirical research. Testing significance in experimental research 
will be taken up in Chapter 15; testing significance in empirical research will 
be considered in Chapter 16. 

Although the testing of significance will be emphasized, we do believe that 
the student should be introduced to estimation procedures. Our choice of 
examples for an introduction to estimation places the discussion in Chapter 
16, which deals with empirical research. As we have indicated, estimation has 
many applications in experimental research, but space does not permit us to 
pursue the topic that far. Consequently, nothing will be said about estimation 
in Chapter 15, which deals with experimental research. 

In labeling the two kinds of research, “ experimental " and “ empirical," 
we do not mean to imply that experimental research is not empirical in nature. 
Acquisition of experience through careful observation is an important part 
of both kinds of research. According to our definition, empirical research is 
planned investigation with the following features: careful observation, classi- 
fication or measurement, establishing relations between qualitative or 
quantitative variables, and prediction involving these variables. Experimental 
research is also planned investigation, with these features and something in 
addition. Experimental research has the scientist intervening in natural pro- 
cesses, not just observing them. He stages the events of the research. He 
introduces changes and observes the consequences of these changes. He 
manipulates one or more variables and observes the effect of the manipulation 
on another variable. He changes conditions around his subjects and measures 
the effect of the change on some aspect of their behavior. 

We have reason for being particular regarding the use of these terms. There 
is no room for loose thought and language in research. Success in acquiring 
knowledge depends to a very large extent on reducing issues to the simplest 
terms possible. If a research issue can be expressed in terms which make 
obvious the nature of a correct resolution of that issue, one is then on firm 

ground. If a research issue remains vague in one's thought and expression, 
and if its resolution is largely a matter of conjecture, then there is little chance 
for success. Rigorous thought and language should be the immediate 
objective of the student who hopes to become a research psychologist. 

There is much loose usage of the term “ experiment." Some of the worst 
usage comes about when a would-be researcher adopts the term to lend 
prestige to something he has done or wants to do. The result is that every 
kind of study has, at some time and place, been referred to as an experiment. 
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If everything is*an experiment, there is no need for the term. It becomes 
synonymous with " research." An important discrimination is thereby lost. 

There is a line of thought which connects experimental research with the 
determination of causes. While this usage comes very close to the important 
distinction on which we have been insisting, it ultimately runs afoul of the 
problem of defining “ cause ” and the many philosophical issues which that 
problem entails. Consequently, the word “ cause " becomes a liability rather 
than an asset in making explicit the difference between the two kinds of 
research. 

. The distinction between experimental and empirical research is actually a 
simple one and its importance is obvious when one takes a direct and straight- 
forward view of the research situation. The research psychologist engages in 
ene or the other of two clearly distinguishable kinds of operations. In the one 
kind of operation, he installs himself at some vantage point and observes the 
behavior of human or animal subjects in an environment whose character- 
istics he may also record. The information he acquires from observing may 
turn out to have broad generality and may be of great practical consequence. 
In the other kind of operation, he controls certain conditions, manipulates 
One or more others, and observes the effect of the manipulation on the 
behavior of his subjects. Again the information he acquires may possess 
generality and practical value. As we indicated earlier, the first operation 
Characterizes empirical research; the second operation characterizes 
experimental research. 
es long as the psychologist reports faithfully what he did and as long as his 

pretation of his results is appropriate to what he did, there is no problem. 
ifte begins when he engages in the first operation and tries to interpret 

esults as indicating what would be found out from the second operation. 
That is, he observes, and does nothing more than observe, but draws con- 
clusions from his data as to what would happen if he intervened. Now it is 
Sibus that the most certain knowledge about intervention is going to come 
Poi investigations in which intervention takes place. When the researcher 
Son, nothing more than observe, à conclusion about the effects of interven- 
tion ek be a very bad guess. If he wishes to learn what effect some manipula- 
het o conditions has on the behavior of subjects, he should perform exactly 
will Prio S Even if he actually does manipulate conditions, the results 
"m entail some degree of uncertainty because of error, but if he does not 

nipulate, the prospect is far worse. 

"es Madii over the connection between lung cancer and smoking is 
lin ie ent example of this issue. There seems to be little doubt that scientists 

Es sire a relation between the incidence of lung cancer and smoking. 
fiot A nature of this relation is, remains quite uncertain. What they have 
o a is the effect of manipulating smoking. To be exact, they have 

at the incidence of lung cancer is greater among smokers than 
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among nonsmokers. They have not observed that the incidence of lung cancer 
increases among nonsmokers when they take up smoking. Nor have they 
observed that the incidence of lung cancer decreases among smokers when 
they quit smoking. They have not shown that young people could avoid lung 
cancer by not smoking and that the same young people would contract lung 
cancer if they did smoke. They have not shown that elimination of smoking 
would eliminate the lung cancer now known to be associated with it. 

To sum up, the researcher may observe or he may manipulate and observe. 
Observation alone may yield valuable information. However, if he wishes 
to know the effect of manipulation, he must do it. He cannot claim to know 
the effect of manipulation until he does. 

There is one further complication in the issue. Although one does not find 
out about manipulation by simply observing, it is entirely possible that an 
observed relation will be confirmed in experimental manipulation. Imagine 
that one has observed a relation between two variables—an environmental 
condition and some aspect of behavior. Is it possible that changing the con- 
dition would change the behavior? Yes, but the experimental demonstration 
remains to be done. Thus the observed relation could serve as a pretext for 
an experiment in which it is demonstrated that changing the condition does 
change the behavior. 

Here is an example. Scientists observe that the incidence of tooth decay 
among children in communities having water supplies which contain natural 
fluorides is lower than the incidence in communities whose water supplies do 
not contain those chemicals. A relation is thus established between incidence 
of tooth decay and presence of fluorides in the water. Does this mean that 
they know that introducing fluorides into a water supply will reduce tooth 
decay? No, it does not. It is entirely possible that some other chemical 
associated with the natural fluorides provides the key to the situation. That 
such an action will have the desired effect remains to be established. To find 
out, these same scientists embark upon an extensive experimental project in 
which fluorides are introduced into water supplies and the effects are observed. 
The evidence indicates that the incidence of tooth decay is reduced by 
introducing fluorides into the water supply. Thus an observed relation may, 
but does not necessarily, suggest a manipulation which can be demonstrated 
to be effective. 

We have chosen “ empirical research " as the label for those investigations 
which are limited to observation and we shall reserve “ experimental re- 
search " as the label for investigations in which the psychologist actually 
manipulates one or more variables and observes the effect of the manipulation. 

It should be clear that the distinction between experimental and empirical 
research does not depend on the choice of an i-statistic for the analysis of data. 
Experimental research is distinguished from empirical research by the opera- 

tions of the investigator, but not by his choice of i-statistics. It is not 
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uncommon to encounter a student who expresses a preference for a particular 
i-statistic because, as he says, he wants his study to be an experiment. In 
thinking that the choice of a particular i-statistic can make his study an 
experiment, he has failed to understand the difference between an experiment 
and an empirical investigation. A given i-statistic may be appropriate for an 
evaluation of either kind of research. The beginning student would probably 
be astonished if we were to describe here all of the equivalent methods of 
analysis which can be applied equally well to experimental and empirical 
research data. 

Although the choice of method of analysis does not determine whether the 
research is experimental, appearances might give the student the impression 
that it does. By appearances, we mean that there is some tendency for experi- 
mental data to be evaluated by certain methods while nonexperimental data 
are evaluated by other methods. There is some tendency for experiments to 
be evaluated by means of i-statistics, such as t and F, computed from means 
and variances, There is some tendency for nonexperimental research to be 
evaulated by the same i-statistics computed from correlation coefficients and 
Other measures of association. These tendencies reflect superficial, rather than 
necessary, correspondences. 

The two major applications of statistical inference, testing significance and 
estimating parameters, will be discussed next, in general terms, before they 
are treated in detail in the following chapters. 


Testing Significance 


Observation and measurement in research are always subject to unintended 
and undesired influences which scientists call error. Consequently, in 
announcing positive results, the investigator may be mistaken no matter how 
favorable those results may appear to be. While the researcher cannot avoid 
being misled by outcomes due to error, he can be selective or discriminating 
With respect to those outcomes in two Ways. First, within the totality of 
Strictly error outcomes, he can define a class of the most favorable outcomes, 
by which he is willing to be misled. Second, he can place any desired limit 
on the size of that class. His erroneous decisions to announce positive results 
Vill then occur with a relative frequency which is known and can be stated. 

, The evaluation of research data by application of a statistical test of 
Significance requires computation of an /-statistic which embodies a com- 
Parison of interest to the investigator. The comparison is a difference, a ratio, 
d a combination of the two. To define a class of error outcomes about which 
he is willing to be mistaken and, further, to specify its size, the investigator 
Thust be able to deduce the nature of all possible outcomes for the com- 
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parison. That is, he must know the sampling distribution of an i-statistic 
whose logical properties are relevant to his purposes. Within this sampling 
distribution he can establish two classes of values: error results which appear 
least favorable, or those values of the i-statistic which are smaller than some 
criterion value; and error results which appear most favorable, or those values 
of the i-statistic which are larger than the criterion value. In evaluating his 
research data, he computes the i-statistic which embodies the comparison of 
interest to him and observes the class into which the obtained value falls. If 
it exceeds the criterion value, it belongs in the class of error outcomes about 
which he is willing to be mistaken and he announces positive results. If it is 
exceeded by the criterion value, it belongs in the class of error outcomes by 
which he is not willing to be misled and he acknowledges negative results. 
The size of the class of error outcomes which appear most favorable and 
which will be the occasion for announcing positive results erroneously is 
arbitrarily determined, but it seems reasonable and consistent with the pur- 
pose of scientific research to make the size of the class small rather than 
large. The size of the class is expressed as a proportion of the totality of error 
outcomes. A proportion of 0.05 has gained wide acceptance as being small 
enough to afford the researcher, his colleagues, and his public a reasonable 
degree of protection from false announcements of positive results. Some 
researchers, who are more demanding of themselves, adopt a proportion of 
0.01, thus reducing the size of the class of erroneous decisions considerably. 
To further focus attention on the general characteristics of tests of 
significance, we shall give here an example in which the sampling distribution 
is identified but no details are given about the investigation itself. In the 
evaluation of a certain body of research data, the i-statistic computed was F, 
a variance ratio. The sampling distribution of F for this particular set of data 
could be represented by the function in Figure 12.10. Bear in mind that the 
sampling distribution is a specification of all of the possible error outcomes 
for the investigation. The point on the horizontal axis which separates the 
unshaded area under the curve from the shaded area represents the criterion 
value, F,. The shaded area represents the class of error outcomes about which 
the investigator is willing to be mistaken. The unshaded area represents the 
class of error outcomes about which he is not willing to be mistaken. Having 
computed F, the investigator determines in which class it belongs. He does 
so by consulting a table of F such as Table B, or B, in the Appendix. If the 
obtained F exceeds the criterion value given by the table, the investigator 
announces positive results. If it does not, he acknowledges negative results. 
Classifying a given i-statistic as above or below some criterion value in the 
totality of error results is an action consistent with the viewpoint that the 
result is due to error, a viewpoint which is referred to formally as the null 
hypothesis. When the obtained /-statistic exceeds the criterion value and 
positive results are announced, the investigator is said to reject the null 
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hypothesis. When the value fails to equal the criterion and negative results 
are acknowledged, the investigator is said to accept the null hypothesis. The 
Proportion of error outcomes which are the occasions for announcing 
positive results or rejecting the null hypothesis is called the level of significance. 


Estimating Parameters 


We have said that values computed from a population are called parameters 
and that values computed from one or more random samples are called 
" -statistics." Some values computed from samples correspond to values 
Computed from populations. For example, there is the mean of a sample and 
the mean of the population from which the sample was obtained. There is the 
Variance of a sample and the variance of the population from which the 
Sample was obtained. There is the product-moment correlation for a sample 
9f paired values and the product-moment correlation for the population of 
Paired values from which the sample was obtained. Population values or 
Parameters are designated by Greek letters. Sample values or i-statistics are 


designated by Latin letters. 
; When a real population is too large to measure and describe, th 
itis too large to permit the computation of parameters, it is often 
toa researcher to obtain an estimate of one or more of the parameters. In 
Beneral terms, an attempt is made to infer values of parameters from 
"statistics, 

Estimates of parameters are divided into two broad classes: point estimates 
and interval estimates. 
Pe Point estimate is a single value which. is 

mples and which possesses certain properties a 
SUME Corresponding parameter. We shall consider only point estimates which 
Are unbiased or approximately so. An i-statistic is an unbiased estimate ofa 
Parameter when the mean of its sampling distribution is equal to the para- 
meter. It has been our experience that the beginning student is usually not 
Prepared for this definition. * Unbiased " usually has connotations for the 
Student Which go beyond the somewhat limited and arbitrary definition of 
the statistician. Examples of unbiased estimates will be given in Chapter 16. 
c An interval estimate consists of a pair of values called limits, which are 
e mputed from one or more samples. These limits, which establish an interval, 
T à sampling distribution. The interval has the property that it will a 
fot €ncompass the parameter and will sometimes fail to do so. In on 
fail x of such intervals, some known and stated proportion ihe 
sm " he size of this proportion can be fixed by the researcher and made as 

all as he desires. The size of the interval depends on the chosen proportion 


at is, when 
of interest 


computed from one or more 
which qualify it as an estimate 
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of failures. The smaller the proportion of failures, the larger the interval. 
Several interval estimates will be described in Chapter 16. 

We shall comment here briefly on our reasons for emphasizing tests of 
significance. In testing significance, when an investigator rejects the null 
hypothesis and announces positive results, he is under no obligation to report 
the size of the obtained difference or ratio in terms of the original scale units. 
On the other hand, in establishing an estimate of a parameter, it is in the very 
nature of the procedure that the size of the computed value is indicated. 
When the estimate is expressed in terms of units on a psychological scale, its 
size may be difficult to interpret because of the unknown differences among 
scales, even those intended to measure the same characteristic of behavior. 
Consequently, psychologists are not prone to emphasize the magnitude of 
differences, ratios, or other values, expressed in terms of original scale units. 
Estimation is more likely to be used when the value of interest to the 
researcher is a pure or dimensionless number like a proportion, a percentage, 
or a correlation coefficient. 

There is another reason for the emphasis on tests of significance in 
psychological research. Much of psychological research is exploratory ; 
controls are often poor; measurement is frequently of dubious quality. To 
detect a replicable difference or ratio under these circumstances is itself no 
small accomplishment. Attempting the estimation of the size of a parameter 
is often not justified. Perhaps the day will come when measurement, predic- 
tion, and control in psychological research will improve to the point that the 
emphasis will shift from testing significance to estimating parameters. 


Summary 


Applications of statistical inference in research can be placed in two broad 
classes: tests of significance and estimates of parameters. Research operations 
themselves can also be placed in two classes: experimental research and 
empirical research. It is possible to do tests of significance and estimates of 
parameters in both experimental and empirical research. In this text, testing 
significance will be emphasized. Only a brief introduction to estimation will 
be given. This introduction will be limited to applications of estimation in 
empirical research. Chapter 15 deals with tests of significance in experimental 
research. Chapter 16 deals with tests of significance and estimates of 


parameters in empirical research. 


EXPERIMENTAL 
INFERENCE 


Pragmatics | 15 


An experiment is a planned event in which the researcher manipulates one 
or more variables and determines the effects of this action by making observa- 
tions on another variable. A manipulated variable is called an independent 
variable. The variable on which observation is made is called a dependent 
variable, Although either kind of variable, independent or dependent, may be 
qualitative or quantitative in nature, we shall limit our discussion to 
quantitative variables. 

In psychological experimentation, the manipulation of an independent 
variable takes a characteristic form. The independent variable is the feature 
Which varies in a set of conditions, often called treatments, and the manipula- 
lion is the actual imposition of these conditions on groups of subjects. A set 
of conditions may consist of a number of stimuli which vary along some 
Specific dimension, a number of different levels of some characteristic of the 
environment which can be controlled by the experimenter, varied allowances 
Of some consumable substance such as Water, food, nutrients, or drugs, or 
Variations in tasks and programs prescribed by the experimenter for his 
Subjects, Listed below are four examples of independent variables described 
as sets of conditions. 

.l. Two auditory signa 
differing in frequency. 

2. Three levels of illumination for a testing room. 

3. Three different concentrations of salt in solution in drinking water. 

4. Two different amounts of practice: five trials and ten trials. 

The dependent variable in a psychological experiment is a measure of 
response of the subjects in the experiment. The subjects may be human or 
animal. Typically, subjects differ in their responses, when they are measured 
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Is consisting of pure tones equal in intensity but 
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under uniform conditions, and it is not uncommon for these differences to 
persist in repeated measurement. This is to say that pyschologists frequently 
encounter evidence of stable or reproducible differences among their subjects. 
Because subjects differ, the selection of subjects for the conditions of an 
experiment can bias the final comparisons. Thus differences among subjects 
constitute a major source of error in psychological experiments. 

Subject differences are not the only errors by which experiments can be 
biased. A multitude of influences affect the experimenter's observation of a 
subject's response under a given condition at a specific time and place in the 
conduct of the experiment. The experimenter, of course, tries to control and 
hold constant as many of these influences as he can, and to whatever extent 
he can, so that unintended differences will be eliminated and only intended 
differences will occur in the data. Intended differences are those he is trying 
to produce by means of the varied conditions of the experiment. He can never 
be certain that these efforts to control extraneous influences and eliminate 
bias are completely successful. 

The presence of error and bias in experimental data introduces uncertainty 
into the evaluation of the data. In announcing positive results for an experi- 
ment, the investigator can never be certain that he has not been deceived by 
error. The only kind of rigor he can achieve in his evaluation is the rigor of a 
test of significance in which the probability of an incorrect rejection of the 
null hypothesis is known and stated. Meaningful application of a test of 
significance requires, of course, that the distribution of errors have the 
attributes of random variation. 

Three simple plans for conducting and evaluating experiments are described 
below. Plans for experiments are often called experimental designs. 


The Completely Randomized Design 


Imagine an experiment in which we wish to determine the differential effect 
of two conditions, 4, and A». Each of the two conditions is to be imposed 
on a group of subjects, after which the subjects are to be measured on à 
response variable, Y. 

The selection of subjects and their assignment to the two conditions is à 
critical step in the conduct of the experiment. Subjects can be expected to 
vary prior to the experiment and the differences among them will act as errors 
which can bias the experimental comparison of conditions in many different 
ways, depending on the selection of subjects and on the particular arrange- 
ment of subjects into groups, produced by the assignment. 

Imagine that we have selected (2n) subjects from some large supply of 
subjects, that we have assigned them to two conditions with n subjects pet 
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condition, and that we have imposed the conditions and obtained measures 
on the dependent variable. We recognize that the particular arrangement of 
subjects which results from the selection and the assignment is only one of 
many possible arrangements. 

Table 15.1 shows the main features of the design and the way of recording 


Table 15.1 


AN EXPERIMENTAL DESIGN WITH TWO CONDITIONS 
AND TWO RANDOM SAMPLES 


Conditions 
ER En 
First Second 
Sample Sample 


the data which consist of (2n) measures. Each column of the table corresponds 
toa sample and the condition which was imposed on the subjects in that 
Sample. Notice there are no rows in the table. The double subscripts permit 
the identification of any measure in either column. 

It is obvious that the total variability in the (27) observations will contain 
three different kinds of differences. It will contain differences due to the 
effects of the treatments, to the extent there are such effects. It will contain 
differences among subjects, to the extent that preexperimental inequalities of 
response have persisted into the experiment. Finally, it will contain unin- 
tended differences arising from extraneous influences which the experimenter 
has failed to eliminate. The intended differences are often called treatment 
effects. The unintended differences are often called error. 
gon total variability in the (27) observations can be analyzed into two 
: Mponents: the between-sample sum of squares and the combined within- 
ois sum of squares. It will be interesting to consider the relation between 
" * two components produced by the arithmetic analysis of the total sum of 
EUR and the two sources of variability described above as treatment effects 
Sine ae Under certain circumstances to be described below, the Peres 
Si will be contained in the between-sample variability and nothing but 

or will be contained in the within-sample variability. 

Treatment effects can be divided logically into two classes: those effects 
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which are constant and additive and those which are not. Let us consider the 
first possibility, the case of constant and additive effects. We shall return later 
to the other possibility. A treatment effect is constant and additive when all 
subjects in a particular condition of the experiment change with the same 
increment or decrement of response. When the effects of conditions A, and 
A, are constant and additive, but different, the differential treatment effect 
is contained in the between-sample variability and does not enter the within- 
sample variability. 

The conclusion that the differential effects of treatments will not be con- 
tained in the within-sample sums of squares can be justified in the following 
manner. It was established in Chapter 4, in connection with the discussion 
of coding transformations, that adding a constant to, or subtracting a con- 
stant from, each number in a set does not change the variability of that set. 
A constant additive treatment effect can be conceptualized as a constant value 
added to, or subtracted from, the response of each subject in a sample. Con- 
sequently, the variability among the measures of response in a sample will not 
be different from what it would have been had there been no treatment. That 
is to say, the variability within a sample will not have changed during the 
experiment. Since the total variability consists only of two components, the 
between-sample variability and the within-sample variability, the effects of 
treatments must be contained in the between-sample variability. 


The between-sample sum of Squares measures differences among subjects 
who have been treated differently and, therefore, contains any differential 
constant and additive treatment effects, but that is not all. The between- 
sample sum of squares may also contain a bias resulting from the selection 
and assignment of subjects. Because of the random selection and assignment, 
the samples may not be exactly comparable at the start of the experiment. 
Thus we often say that the between-sample variability is the apparent treat- 
ment effect, by which we mean that it is the tre. 


atment effect plus some 
unknown bias. 


The error effects in the experiment consist of preexperimental differences 
among subjects and differences arising from uncontrolled and unintended 
influences Operating in the experiment. Let us focus our attention for the 
moment on that part of the error consisting of differences among subjects. 
Differences among subjects are errors which affect the comparison as a 
consequence of the selection of subjects and their assignment to the conditions 
of the experiment. 

The combined within-sample sum of Squares measures differences among 
subjects treated alike, Since constant additive treatment effects will not affect 
the variability of a sample, that variability provides a basis for gauging the 
size of errors. We often say that the within-sample variability provides an 
estimate of the bias to which the experiment is liable. 


Let us review the equivalences of the terms and concepts introduced above 
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to characterize this simple experiment involving two conditions and two 
random samples of subjects. 

The total sum of squares is a measure of the variability in all of the data. 
It contains differential treatment effects, differences due to inequalities among 
subjects, and other errors. The total sum of squares can be analyzed into two 
components. 

The between-sample sum of squares measures differences among subjects 
treated differently. It contains differential treatment effects and, possibly, a 
bias from errors. It is referred to as the apparent treatment effect. 

The within-sample sum of squares measures differences among subjects 
treated alike. It contains subject inequalities and errors of other kinds. From 
the within-sample sum of squares we obtain an estimate of the size of errors 


affecting the experiment. 


THE GUARANTEES OF RANDOM ERROR 


Two kinds of errors affect the comparisons in a psychological experiment. 
One kind consists of inequalities among subjects. These inequalities become 
errors, in the selecting and assigning of subjects with respect to the conditions 
of the experiment. The experimenter can insure that these errors are randomly 
distributed over the conditions by using a table of random numbers or some 
other tested device to make the selection and assignment. The other kind of 
errors consists of unintended differences produced by uncontrolled influences 
and associated with the particular occasion on which an observation is 
Obtained. The experimenter can insure that errors associated with occasions 
of measurement are also random in their distribution over the conditions of 
the experiment. He can do so by arranging for the random conjunction of a 
subject, a treatment, and the occasion on which the subject is measured under 
that treatment in the experiment. 


THE EVALUATION 


The random selection of subjects and the random assignment of subjects 
and conditions to occasions of measurement can produce any one of many 
different combinations of an apparent treatment effect and an estimate of 
error. That is, random selection and assignment can produce many different 
combinations of values for the between-sample variability and the within- 
sample variability. Just on appearances, the outcomes which are most favor- 
able for the announcement of positive results are those which suggest large 
treatment effects and small errors. Even though it must be admitted that these 
most-favorable-appearing outcomes could actually be due to random error, 
if one must risk being misled, it is reasonable to take one's chances on them 
rather than on the outcomes which suggest small treatment effects and large 
errors. 
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The experimenter needs a systematic procedure for detecting outcomes 
which give the most favorable comparisons of apparent treatment effect and 
error. There are two i-statistics which provide the desired comparison. Either 
one employed in a test of significance provides the desired systematic pro- 
cedure. One is t as computed by formula (12.2), which is 


Y D, . 
42V [n 
This ratio compares the apparent treatment effect, as measured by the 


difference between two sample means, with an estimate of error computed 


from the within-sample variability. The other i-statistic is F as computed by 
formula (12.4), which is 


t 


V, 

Va 

This variance ratio compares the apparent treatment effect, as measured by 
the between-sample variability with the estimate of error, as given by the 
combined within-sample variability. 

In the test of significance, the result of the comparison is classified as being 
larger or smaller than a predetermined criterion value. If the obtained value 
falls in the class of error outcomes by which the experimenter is willing to be 
misled, he rejects the null hypothesis; if it does not, he accepts the null 
hypothesis. 

With two conditions and two samples, ¢ by formula (12.2) and F by 
formula (12.4) give exactly the same result for the test of significance. As a 
matter of fact, £? = F when c, the number of samples, is two. F has the 
advantage that it can be used when the number of conditions is greater than 
two. When F is employed, the evaluation is usually reported as an analysis 


of variance, and the results are commonly represented in the form of 
Table 15.2 


F= 


Table 15.2 


GENERAL FORM OF THE ANALYSIS OF VARIANCE 
FOR A COMPLETELY RANDOMIZED DESIGN 


Component of Variability SS df y F 
Between samples or conditions B c—1 Vi VJPFELD 
Within Samples or conditions Ww N—c Vg 

Total I N-1 


B = the between-sample sum of squares, 


W = the combined within-sample sum of squares. 
T = the total sum of squares. 


NONADDITIVE EFFECTS 


Constant and additive treatment effects do not affect the estimate of error, 


The Completely Randomized Design 259 


which is computed from the combined within-sample sum of squares. Treat- 
ment effects which are not constant and additive may change the variability 
of a sample. When two or more samples of subjects are treated differently 
and the effects are not constant and additive, the result may be differences 
between the sample variances not solely due to error. There is a test of non- 
additivity for an experiment involving two conditions and two samples. The 
test of significance involves the comparison of one sample variance with 
another in an F ratio, as given by formula (12.5), 

n 


dab 3 


The test is often called a test of homogeneity of sample variances. When the 
Obtained F is significant, we often say that we have found heterogeneous 
variances. There is also a test for the case of more than two samples. The 
student will find the test described by Edwards (1), Ray (2), Walker and 
Lev (3), and others. 


AN EXAMPLE 


A psychologist obtains two random samples of rats of a particular strain. 
He administers a small, carefully measured amount of a common stimulant 
to the animals in one sample. He administers a carefully prepared placebo 
containing none of the stimulant to the other animals. He then observes the 
number of revolutions of an activity wheel each animal produces in the first 
hour after it has been fed to satiation. Table 15.3 gives the recorded-observa- 
tions. The psychologist proceeds to analyze the variability in the 20 observa- 
tions, as follows. 


Table 15.3 
ACTIVITY MEASURES FROM THE STIMULANT EXPERIMENT 


Conditions 
Placebo Stimulant 
Sample 1 Sample 2 
7 4 
13 43 
42 30 
17 47 
9 23 
20 31 
20 32 
15 34 
14 15 
22 43 


N — 20 — number of animals. 
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Summary data for the entire set of 20 measures in Table 15.3: 


ZY =481 
ZY? = 14,775 
N =20 
(ZY} _ 481? — 
C, => = a = 11568.05. 


Computation of the total sum of squares: 
ZY? — C, = 14,775 — 11,568.05 = 3,206.95 
Summary data for Sample 1: 


ZY = 179 
ZY? = 4,057 
n = 10 
Computation of the sum of squares for Sample 1: 
E 179% 
zp- 4,057 — — = 852.9 
n 10 
Summary data for Sample 2: 
ZY = 302 
ZY? = 10,718 
n = 10 
Computation of the sum of squares for Sample 2: 
ZY)? 302* 
Ey: - 7) = 10,718 — io = 1,597.6 


Computation of the combined within-sample sum of squares: 


Xy) 
EL m = d = 852.9 + 1,597.6 = 2,450.5 


Computation of the between-sample sum of squares: 
(ZY) 179? + 302? 
: — €. = 5 - — 11,568.05 = 756.45 


The fact that the total sum of Squares should be equal to the between- 


sample sum of squares plus the combined within-sample sum of squares 
provides a check on the computation. 


3,206.95 = 756.45 + 2,450.5 
The computation checks. 
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Vj, the variance for the apparent treatment effect, is computed by dividing 
the between-sample sum of squares by (c — 1) — 1 degree of freedom. 
756.45 
V, = =a S 756.45 
V. the variance for the estimate of error, is computed by dividing the 


w 


combined within-sample sum of squares by (N — c) = 18 degrees of freedom. 


, 0 2450.5 
w = 18 
The null hypothesis is that observed differences between conditions are due 
to error. Let the level of significance be 0.05. The test of significance involves 
computing F, the ratio of V; to V; or the ratio of the treatment variance to 
the error variance. 


= 136.14 


Vi 756.45 
= Vi T 136.14 
F, the criterion value of F for one and 18 degrees of freedom and the 0.05 
level of significance, is 4.41. (See Table B, in the Appendix.) Since the 
obtained F exceeds the criterion F, the psychologist rejects the null hypothesis 
and announces positive results. The results are summarized in conventional 
form in Table 15.4. 


= 5.56 


Table 15.4 
ANALYSIS OF VARIANCE FOR THE STIMULANT EXPERIMENT" 
Component SS df V F 
Apparent treatment effect 756.45 1 756.45 5.56? 
Estimate of error 2,450.5 18 136.14 
Total 3,206.95 19 


“Data in Table 15.3. : 
"Criterion value, F,, is 4.41 ; null hypothesis is rejected. 


A test for nonadditivity of treatment effects should also be applied. The 
test involving F requires the computation of a variance for each sample. Vj, 
the variance for the first sample, is obtained by dividing the sample sum of 
squares, 852.9 by (n — 1) — 9 degrees of freedom. 


852.9 
Vi =g =% 


V}, the variance for the second sample, is obtained by dividing 1,597.6 by 
(n — 1) — 9 degrees of freedom. 


3991. 
V, = A 2 e = 177.51 
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Since tables of F give ratios greater than one, we compute by dividing the 
larger sample variance by the smaller. 


UL PT 


"y 94.77 
Since this procedure of putting the larger sample variance over the smaller 
will double the number of ratios greater than one, we shall need to compensate 
by choosing a level of significance which when doubled will give us the 
desired value. We choose 0.025. The effective level of significance is then 0.05. 
The null hypothesis is that the observed difference between variances is due 
to error. F, the criterion value for nine and nine degrees of freedom, is 4.03. 
(See Table B,) The null hypothesis is accepted and the assumption of 
constant additive treatment effects is taken to be justified. 

Even if significance is obtained in the test of homogeneity of sample 
variances, there is no reason to abandon the main analysis. The consequences 
of nonadditivity are not serious. It has been shown that heterogeneity of 


sample variances has the effect of making the true level of significance slightly 
larger than the chosen level. 


F 


The Matched-Group Design 


: ; Consideration of the actual matching 
procedure, 


If we assign the members of e 
the residual errors, representi 
randomly at the same time, If 
pair are assigned randomly to t 


ach pair randomly, one to each condition, 
ng failures in matching, will be distributed 
the treatment-subject combinations for each 
he two occasions on which observation is to 


The Matched-Group Design 263 


take place, then the errors associated with occasions of measurement will also 
be randomly distributed over the conditions of the experiment. 

Imagine that these randomization procedures have been carried out, that 
the conditions have been imposed, and that measures on the dependent 
variable have been obtained. The data can be tabulated in two columns, 
corresponding to conditions A, and A», and as many rows as there are pairs 
of subjects. The general plan for two conditions and r matched pairs of 
subjects is shown in Table 15.5. Recall that the total sum of squares in a 


Table 15.5 
GENERAL PLAN FOR A MATCHED-PAIR DESIGN 


Conditions 
EY As 
1 
2 
3 
Pairs . 
24 


double classification can be analyzed into three components: the between- 
column sum of squares, the between-row sum of squares, and the residual 
Sum of squares. 

The magnitude of differences among pairs, relative to the total variability, 
Will depend on the effectiveness of the matching procedure. If the matching 
is nearly perfect, errors will be very small and most of the intersubject vari- 
ability will appear in the between-row component. If the matching is poor, 
errors will be large and much less of the intersubject variability will appear 
in the between-row component, on the average. Large or small, differences 
among pairs do not enter into the evaluation of the experiment. Since the 
experimental comparison can be made within each pair, only the failures in 
matching must be taken account of in the evaluation. The extent to which a 
given pair differs from any other pair is of no concern to us. 

Differential, constant, additive treatment effects will enter the between- 
column component and will not affect the between-row variability or the 
residual variability. Since between-row variability consists of differences 
among pairs and between-column variability contains treatment effects, the 
residual variability must provide an estimate of error. That constant additive 
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treatment effects do not affect the residual variability can be deduced, as 
follows. . . . 

Let Y; be the measure on the dependent variable for the jt" matched pair 
and the kth treatment. Now Y;, can be conceptualized as consisting of two 
parts: the level of response which would have been obtained without treat- 
ment and the increment (or decrement) of response produced by treatment. 


If we let Lj, be the level of response without treatment and d, be the 
increment or decrement produced by treatment, then 


Yir = Lg + ay. 


This equality statement should not be interpreted as meaning that the 
experimenter ever knows the values of Lj, and a,. If he actually knew the 
value of a,, there would be no point in doing an experiment. We have intro- 
duced these symbols for deductive purposes only. They do not have to be 
given numerical values to serve our purposes. Notice that a, varies only 


from column to column, but that it is a constant from row to row, within any 
column. 


The mean of the j'^ row can be written 


Y;. = Lj. + a, 
where d, is the average of the c values 
means with one another, 
TOW sum of squares, dá. 
mean of the third row 


of a, in any row. In comparing row 
à process implicit in the computation of the between- 
will cancel. For example, the difference between the 
and the mean of the fifth row can be written 


Y, Y, Dy LL. 

Thus the between-row variabilit 
pairs and will be unaffected by 
The mean of the kth 


y will be a measure of the differences between 
the treatments. 


column can be written 


Dap = Ey + Ak. 


t 
In comparing column means, a 
between-column sum of squares, 


second column will be 


Pg x = (Ly + a) — (Le + ay). 

Since the two treatment effects, 4, and a,, 
variability will contain this differential effect, 
The residual deviation for the j'^ row and the kth 


may differ, the between-column 


column is 


Te = Y.— Vy Fy ob M, 


M ,, the total mean, is equal to the average row mean or the average column 
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mean. In either case, 


M,=M, +4. 
We have said that 

Yi, = Lj + ds, 
and have shown that 

Y. =L. +ã 


and 


Substituting for Y;,, Y;, Y.,, and M, in the formula for r;, yields 
rj; = (Lir + ag) — (5. T4)-— (Es + ay) + (Mi + à). 


Removing parentheses and combining terms gives an expression for rj; in 
which a; and d. do not appear: 

rg = Lg — L; — Ly + Mi. 
We conclude that the residual variability does not contain the constant, 
additive effects of treatments, that it reflects failures in matching which have 
been randomly distributed over conditions, and that it provides an estimate 
of error. 

The variability in the data for a matched-group design can be analyzed into 
three components: the apparent treatment effect as measured by the between- 
column sum of squares, the differences among the matched groups as 
measured by the between-row sum of squares, and the error variability as 
measured by the residual sum of squares. The investigator evaluates the 
experiment by comparing the treatment variance to the error variance in a 
test of significance. He chooses a level of significance. The treatment variance, 
V, is computed by dividing the between-column sum of squares by (c — 1) 
degrees of freedom. The error variance, V;, is computed by dividing the 
residual sum of squares by (r — 1)(¢ — 1) degrees of freedom. The comparison 
involves computation of an F ratio, 


The investigator compares the obtained F with Fe, the criterion value, and 
accepts or rejects the null hypothesis. 

The general form of the conventional presentation of the results of an 
analysis of variance, applied to a matched-group design, is given in Table 15.6. 


THE MATCHING PROCEDURE 


The first step in matching is the measurement of all subjects prior to the 
experiment and under uniform conditions on a variable X which is known to 
be correlated with Y. The second step is the ranking of subjects according 
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Table 15.6 


THE ANALYSIS OF VARIANCE FOR 
A MATCHED-GROUP DESIGN 


Component SS df 4 d 
Treatments C c—1 Vt VV; 
Groups G r—i 
Error R (r — Ife — 1) Vi 

Total T N—1 


C — between-column sum of squares. 
G = between-row sum of squares. 

R = residual sum of squares. 

T — total sum of squares. 


to the magnitude of their measures on X. The third step is the pairing. The 
first and second subjects in the ranking are put in one pair; the third and 
fourth subjects are put in a second pair; the fifth and sixth are put in a third 
pair; and so on. Matching in this fashion on Y indirectly matches the 
members of each pair on Y. The effectiveness of the matching depends, of 
course, on the correlation between X and Y. 

The members of each matched group are assigned randomly to the 
conditions of the experiment. Each subject-treatment combination is then 
randomly assigned to an occasion of measurement. 

When the number of conditions is greater than two, the matched groups 
must be correspondingly larger. Thus for an experiment involving three con- 
ditions, each matched group would consist of three subjects. Subjects would 
be ranked on the preexperimental variable, X. The first three subjects in the 
ranking would then be chosen for the first group; the next three subjects 
would be placed in the second group; and so on. 


AN EXAMPLE 


A psychologist wishes to determine the effect of paying subjects for their 
participation in an experiment. He chooses a measure of visual acuity as the 
dependent variable. He obtains a sample of 20 male, freshman college 
students. He measures all of the student subjects on the test of visual acuity 
under standard conditions prior to the experiment. He next ranks the 20 
subjects on this preliminary measure and then forms 10 matched pairs. The 
members of each pair are then assigned randomly to the two conditions: Ay, 
no payment for participation in the experiment; 4,, payment of one dollar 
per hour for participation. Subjects in condition A, are informed they will be 
paid. Subjects in A, are not told they will be paid. Measures of visual acuity 
are obtained on all subjects. To insure the maintenance of good public 
relations, subjects in A, are paid, after the experiment is over. 

The data for the visual acuity experiment are given in Table 15.7. Measures 
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are expressed in millimeters. The analysis and evaluation of the data are 
described below. 

Table 15.7 


THE MATCHED-PAIR EXPERIMENT 
ON VISUAL ACUITY 


Conditions 
Ai As 


2 
3 
4 
Pairs of 5 
Subjects 6 
7 
8 
9 
10 


Summary data: 
Let r be the number of rows; c, the number of columns; N — 


total number of observations. 
Let Y be the observation in any cell; R, the sum of any row: C, the sum of 
any column; XY, the sum of all values; EY?, the sum of squares of all 


original values. 


(rc), the 


r— 10 c=2 N — 20 
SY —284 Xy? = 4,166 
SR? = 8,274 XC? = 40,378 

284? 
Cy = ^3 74038 


Computation of the total sum of squares: 
Xy? — C, — 4,166 — 4,032.8 — 133.2 


Computation of the between-column sum of squares: 


ZC? . 40,378 


— — Cy = Fg — 4,032.8 = 5.0 
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Computation of the between-row sum of squares: 


2 8,274 
ZR Ge 5 
D. 


4,032.8 — 104.2 


Computation of the residual sum of squares: 
Let rj, be the residual deviation for the jt row and the kt" column. Then 


Tik Y, — Y; — Y. 4- M, 
ny = 19 — 18.5 — 13.7 + 142 = +1 
ror = 14 — 165 — 13.7 + 14.2 = —2 


ll 


Ta = 0 Ja = +2 
fa = +1 ra = 0 
fa = 0 mg ==] 
fa ==] "ss = 0 
™ ——l Fe = +l 
ra = 0 Toe, = hl 
To. = +2 ‘Te = 0 
To. = 0 "on = —2 
fy ==] rw: = 0 
Ar = 24 


The fact that the total sum of squares should equal the combination of the 
ree components provides a check on the computation. We observe that 


133.2 = 5 + 104.2 + 24. 
The computation checks. 
Table 15.8 shows the sums of s 


th 


quares, degrees of freedom, variances, and 
ance. The null hypothesis is that differences 


ndom error. The criterion value for the 0.05 
level of significance and one and nine degrees of freedom is 5.12. (See Table B; 


in the Appendix.) The obtained value of F, 1.87, is not significant. The null 
hypothesis is accepted. The researcher does not announce positive results; he 


may decide to abandon this line of research or to try to improve the 
experiment and repeat it. 


between conditions are due to ra 


Table 15.8 
ANALYSIS OF VARIANCE FOR THE MATCHED-PAIR DESIGN? 
Component SS df V F 
Treatments 5.0 I 5.0 1.87^ 
Groups 104.2 9 
Error 24 9 2.67 
Total 133.2 19 


“Data in Table 15.7. 


^F... the criterion value for the 0.05 level of significance and one and nine degrees of 
freedom, is 5.12. The null hypothesis is accepted. 
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A Design with a Correlational Adjustment 


Given observations on a dependent variable, Y, for two random samples 
on which different conditions have been imposed, we can present the data 
as shown in Table 15.1. This table was used earlier in the discussion of the 
completely randomized design. We can use the table again as the starting 
point for our discussion of a design with a correlational adjustment. 

We know that differences among the observations in Table 15.1 may be 
due in part to differences which existed among the subjects prior to the 
experiment and which persisted through the experiment. We could adjust 
the observations on Y, for the purpose of eliminating these initial differences, 
if we had available measures obtained on the subjects prior to the experiment. 
We could then eliminate that part of the variability in Y which was accounted 
for by variability existing prior to the experiment as measured by X, a 
variable correlated with Y. By doing so, the size of the bias in the apparent 
treatment effect would be reduced, on the average, and the estimate of that 
bias would also be reduced a corresponding amount, on the average. The 
result would be an experiment subject to errors which were smaller than those 
of a completely randomized design. 

To employ a correlational adjustment, the following operations are 
necessary. Obtain two random samples of subjects from a supply. Measure all 
Subjects on .Y under uniform conditions. Impose the two conditions, A, and 
Ag, one condition on each sample. Measure the subjects on Y, the dependent 
variable. Table 15.9 gives the general form in which the measures can be 
arranged. 

Table 15.9 
GENERAL FORM OF THE DESIGN INVOLVING 
A CORRELATIONAL ADJUSTMENT 


Conditions 
Ay Az 
Sample 1 Sample 2 


Ss = Subjects. 


Consider a new way of analyzing the variability in the two samples of 
Paired measures represented in Table 15.9. 
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Let r; be the product-moment correlation between Y and Y over the entire 
table, disregarding the classification by samples. Then r7 is the proportion of 
Y variability correlated with X, and (1 — r?) is the proportion not correlated 
with X. 

Let Zy? be the total variability in Y as given by the sum of squares of 
deviations from M,, the total mean. The total variability includes treatment 
effects, differences among subjects, and other errors. 

Then r? Xy? represents the predicted variability, a sum of squares reflecting 
differences among subjects to the extent that these differences can be pre- 
dicted from X. Furthermore, (1 — r?) £y? represents the unpredicted variabi- 
lity, a sum of squares containing treatment effects and errors other than the 
predictable differences among subjects. It will be convenient to represent this 
unpredicted sum of squares by the expression, (T + Ej). T stands for treat- 
unpredicted sum of squares 
S expression is not meant to 


nalysis by defining another product- 
an be computed from the data. Let rw 
ation between Y and Y, That is, 


moment correlation coefficient which c. 
be the combined within-sample correl 


correlation between Y and 
each variable, 


Then r2 is the Proportion of within- 


the within-sample X variability, and (1 
with X. 


sample Y variability correlated with 
— ri) is the proportion not correlated 
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: Note that ZZ? is the combined within-sample variability in Y. It includes 
differences among subjects treated alike and other errors, but does not 
include treatment effects when they are constant and additive. 

It follows that r2.Z Xy? is the within-sample predicted variability, a sum of 
Squares measuring differences among subjects, to the extent that these 
differences can be predicted from X. Furthermore, (1 — r2)2 2? is the 
within-sample unpredicted variability, the sum of squares which provides an 
estimate of error. Let E, be this error sum of squares. It can be computed 
and used to estimate E, the error component or bias in (T + Ej). 

By the subtraction, 

(T + E) — Ey, 
we obtain a quantity which we shall call Tpiasea, the apparent treatment 
effect. Thiasea is a sum of squares which measures the treatment effect as it 
may have been biased by error and we therefore call it the apparent treatment 
effect. We can evaluate Tpiasea in a test of significance. 

The analysis and evaluation are summarized in Table 15.10. This kind of 


Table 15.10 
GENERAL FORM OF THE ANALYSIS OF COVARIANCE 


Component SS df V F 
Treatment effect — Tyyinsea c—1 Vi Vive 
Error Ew N—c—1 Vi 


Thinsea = sum of squares for apparent treatment effect. 
Ew = sum of squares for estimating error. 


analysis, in which the original data for the dependent variable are adjusted 
With respect to preexperimental differences among subjects as measured by 
a variable X, is often referred to as an analysis of covariance. 

It is appropriate to test the sample variances on Y for evidence of non- 
additivity of treatment effects. It is also appropriate to test the homogeneity 
of sample correlation coefficients as providing further evidence on the non- 
additivity of treatment effects. These tests for nonadditivity will be illustrated 
in the example which follows. 

The effectiveness of the covariance design in reducing the size of errors 
depends on the magnitude of the correlation between X and Y. If this cor- 
relation is zero, there is no point in doing an analysis of covariance. When 
the correlation is zero, one discards the preexperimental measures and does 
a simple analysis of variance on the Y measures. One's decision about the 
magnitude of the correlation is based on a test of significance of the within- 
sample predicted variability. A predicted variance, V;, can be computed from 
the within-sample predicted sum of squares, r2,2Zy?, by dividing it by one 
degree of freedom. F is the ratio of V; to V}, the error variance from the 
main analysis of Table 15.10. 
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AN EXAMPLE 
Table 15.11 contains experimental data which will be analyzed and 
evaluated by the analysis of covariance. The data were obtained from an 


experiment in which the psychologist wanted to determine the effect of 
practice on scores for a mechanical assembly test. The experimenter obtained 


Table 15.11 
DATA FOR THE ASSEMBLY-TEST EXPERIMENT 


Conditions 


1 2 
Sample | Sample 2 


Qo a ROO 00 M — — M 


X 
4 
3 
4 
3 
5 
6 
5 
3 
3 
3 


[LUC ANUANN 


e 
SOooc-o0unrumn- 


OQ o 00 -4 Ov ta 4 WN — 
ta dod RR OR UD C2 UD Cà Go X 


Ss — subjects. 
X = preexperimental test score. 
Y = dependent variable test score. 


two samples of subjects and measured all subjects on a different assembly test 
under uniform conditions prior to the experiment. He then imposed the 
practice conditions, which were one hour for the first sample and two hours 
for the second sample. Subjects practiced on the initial assembly test. He then 
administered the assembly test which yielded the scores on the dependent 
variable. A score on either test indicated the number of assemblies completed 
in a five-minute period. 


Summary data for the entire set of measures: 


EX =79 ZY = 148 
ZX? = 329 ZY? = 1,334 
ZXY = 617 
N = 20 


Computation of the total sums of squares and products: 


(Zxy 79 
2 = v uL cA nmm = 
Zx 2X "= 329 20 = 16.95 
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SY% 1482 
xp = zy: —! s - 
y y - CT = 1,334 o = 2388 
XXZY (79148) 
Bie ( 
xy -Zzxv-457--6nm 2 324 


The adjustment of the total sum of squares, Zy?, to obtain (T + Ej), the 
sum of squares of errors of prediction, is defined by the expression 

a — max 
which can be written as 


Zw 32.4? 
D rgy — yl 22 238.8 S 176.867. 


(T + Ej), the sum of squares of errors of prediction based on the total sums 
of squares and products, is 176.867. 


Summary data for Sample 1: 


LA = 39 EY = 59 
ZX = 163 ZY? = 459 
XXY = 252 
n= 10 
Computation of sums of squares and products for Sample 1: 
aX} 39? 
E 2X? C d" 163 10.9 
n 10 
2 2 
Pu ZY? — Cs — 459 9s = 110.9 
n 10 
39)(59 
Axy = ZXY we - 252 (3959) 21.9 
n 10 
Summary data for Sample 2: 
ZX —40 ZY =89 
AX? = 166 AYy*-— 875 
ZXY = 365 
n= 10 
Computation of sums of squares and products for Sample 2: 
ZXP 40? 
Ax? Ax? is e 166 -= 6 
n 10 
ZYF 892 
t ) 82.9 


2 2 
sy = ZY —~ = 813 —Fp 


EXXXY , _ 40189) , 


Zxy = XY = 36 10 
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Combining of sums of squares and products for the two samples: 
ZZXx = 10.9 + 6 = 16.9 


ZXy = 110.9 + 82.9 = 193.8 
ZZxy = 21.9 +9 = 30.9 


The adjustment of the within-sample sum of squares, YLy?, to obtain Ew, 
the sum of squares of errors of prediction based on the within-sample sums 
of squares and products, is defined by the expression, 

(1 — nzzxy, 
which can be written as 


_ 2oF 


ZXy —pBEXy = Xx 33x 


E, is computed, as follows: 


3 


0.92 
193.8 — 169 ^ 137.302 


Thus £,, the sum of squares of errors of prediction based on the within- 
sample sums of squares and products, is 137.302, 

Tbiasea, the sum of Squares for the apparent treatment effect, is the 

difference, 

(T + E) — E, = 176.867 — 137.302 — 39.565. 

Table 15.12 gives Tpiasea, E, 


w degrees of freedom, variances computed from 
the two sums of squares, and the F ratio comparing the apparent treatment 


Table 15.12 
ANALYSIS OF COVARIANCE FOR THE ASSEMBLY-TEST EXPERIMENT" 


Component SS df V F 
Treatment 39.56 1 39.56 4.90» 
Error 137.30 17 8.08 


“Data in Table 15.11. 
°F, for the 0.05 level of significance and one and 17 degrees of freedom is 4.45; the null 
hypothesis is rejected. 


effect with the estimate of error. 
treatment effect is due to error. F, 
and 17 degrees of freedom is 4.45, ( 


The null hypothesis is that the apparent 
for the 0.05 level of significance and one 
See Table B, in the Appendix.) Since the 


obtained value of F, 4.90, exceeds F., the null hypothesis is rejected and the 
investigator announces positive results, 


The main evaluation of the assembly- 


test experiment has been described 
above. The test of Significance of the 


correlation between the dependent 
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variable, Y, and the adjusting variable, X, will be described next. This order 
of events is followed here because of its convenience in explaining the 
analysis. In actual practice, the test of significance of the correlation between 
X and Y would precede the main evaluation. If the correlation proved to be 
nonsignificant, an analysis of covariance would not be used for the main 
evaluation. 

The correlation to be tested is rw, which is based on the within-sample 
sums of squares and products. A convenient way of performing the test is to 
compute the within-sample predicted sum of squares, 


rosy’, 
which can be written 
(2Zxy)? 
Saxe 
and computed, as follows: 
30.9? 
Ta 7 56.50 


The predicted variance, V}, is computed by dividing the predicted sum of 
squares by one degree of freedom. Thus 


The test of significance requires the computation of F, the ratio of the 

predicted variance, V, to the error variance, V^, from Table 15.12. That is, 
V, — 56.50 
F = y: = 808 ^ 6.995. 

The null hypothesis is that the observed correlation between X and Y is due 
to error. F, for the 0.05 level of significance and one and 17 degrees of 
freedom is 4.45, (See Table B, in the Appendix.) Since the obtained value 
exceeds the criterion value, the null hypothesis is rejected. The finding of a 
significant correlation between Y and Y justifies the use of analysis of 
covariance in the main evaluation, which has already been described. 

There are two tests for nonadditivity of treatment effects. The first involves 
the comparison of one sample variance with the other. The comparison of 
sample variances is carried out only for the dependent variable, since the 
treatments could not have had any effect on X. The variance for the first 
sample is 


10.9 

^ =" = 12.32. 
The variance for the second sample is 
, _ 829 

i= 5 = 9.21. 
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The ratio of the larger variance to the smaller is 
Vi 1232. 
921 — 


V3 

Since we have compared the larger with the smaller, we must choose a value 
of F, which is exceeded by 0.025 of values in the sampling distribution to 
achieve an effective level of significance of 0.05. The null hypothesis is that 
the observed difference between the two sample variances is due to error. 
The criterion value, F., for nine and nine degrees of freedom is 4.03. (See 
Table B, in the Appendix.) Since F is less than F,, the null hypothesis is 
accepted. 

The second test for nonadditivity is a test of the homogeneity of the sample 
correlations. We compute £;, the sum of squares of errors of prediction for 
Sample |, as follows: 


_ sy QO = 21.9? 
E, = J — “a = 110.9 705 = 66.90. 


For Sample 2, the sum of squares of errors of prediction is 


_ Quy 


2 mm 9? =m 
E, = D — a = 82.9 — 5 = 09.4, 


E; the combined sum of squares of errors of prediction from the 
individual samples, is computed, as follows: 


E; = E, + E, = 66.90 + 69.4 = 136.30. 
The difference, 


E, — E; — 137.30 — 136.30 — 1.00, 


is a measure of the heterogeneity of the sample correlations. It can be 


evaluated by comparing its variance, V5, computed with (c — 1) = I degree 


of freedom, with V7, a variance computed from E; with N — 2c = 20 — 4 = 


16 degrees of freedom. Thus 


, B.—EQ 100 
ches c—1 P l, 
y E 13630 
N-2.^ j6 = 8:52, 
and 
Vi, 1.00 
Fein 22S W. 
y: 3.52 0.12 


The null hypothesis is that the observed difference between sample correla- 
tions is due to error, F, for the 0.05 level of significance and one and 16 
degrees of freedom is 4.49. (See Table B, inthe Appendix.) The null hypothesis 
is accepted. Since sample variances and sample correlations were not found 


to differ significantly, the assumption of constant and additive treatment 
effects is taken to be justified. 
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Summary 


Three designs for simple experiments have been presented in the present 
chapter. The first design required that subjects be selected randomly, and that 
subjects and treatments be assigned randomly to occasions of measurement. 
There is no provision in this first design for reducing the size of errors to 
which the experiment is liable. Consequently, when subjects are heterogeneous 
and uncontrolled influences have large effects, this completely randomized 
design may yield data containing large biases. The second design required 
that subjects be measured, under uniform conditions prior to the experiment, 
on a variable related to the dependent variable. Subjects are ranked and 
formed into matched groups. Subjects within each matched group are then 
assigned randomly to conditions and to occasions of measurement. To the 
extent that the matching variable and the dependent variable are related, the 
matching will be successful and the errors will be small. The third design 
required the random selection of subjects and random assignment to con- 
ditions and occasions of measurement, with measurement prior to the experi- 
ment on a variable known to be correlated with the dependent variable. After 
the conditions are imposed on the random groups and after measures on the 
dependent variable have been obtained, the resulting data are adjusted by 
computational procedures which take advantage of the correlation between 
the two variables. 

An example was given for each design. The examples involved only two 
conditions. The designs can, of course, be enlarged to incorporate any number 
of conditions. For examples in which the number of conditions is greater than 
two, the student might wish to consult a text on experimental design by 
Ray (2) or others. 

In this discussion of experimental research, only tests of significance were 
considered. No attention was given to the estimation of parameters. It is 
Possible to compute, from experimental data, estimates of means, variances, 
and differences between means, but the procedures of estimation are seldom 
used in psychological experimentation. One reason is the arbitrary nature of 
the units in many psychological scales. Another reason is that much of 
psychological experimentation is exploratory and detecting the differential 
effect of two or more treatments is itself quite an accomplishment. 


REFERENCES 
I. Edwards, A. L., Experimental Design in Psychological Research (New York: 
Rinehart, 1950). . 
2. Ray, W. S., An Introduction to Experimental Design (New York: Macmillan, 
1960). 
3. Walker, H. M., and Lev, Joseph, Statistical Inference (New York: Holt, 1953). 


INFERENCE IN EMPIRICAL RESEARCH 


Pragmatics | 16 


In what we choose to call empirical research, the psychologist observes, 
establishes relations among variables, and makes predictions based on those 
relations. He does not deliberately interfere with or change natural processes 
for the purpose of determining the consequences of such action. Experi- 
mental research is, of course, a more ambitious undertaking than empirical 
research. Experimental knowledge bestows the power to control natural 
events. Because men in practical affairs place high value on experimental 
knowledge, empirical studies are often valued less highly than experiments. 
Nevertheless, empirical research can produce knowledge which makes it 
possible to predict natural events. It is often the only kind of research which 


the psychologist is permitted to undertake by society. In many areas of human 
behavior, experimentation is out of the question. 


Statistical inference is employed in the evaluation of empirical research 
data in both of its major forms: testing significance and estimating para- 
meters. It is not possible to present all of the specific applications of sig- 
nificance-testing and estimation in empirical research. We have chosen only 


a few of the more important tests and estimation procedures for discussion 
here. 


Tests of Significance 


Three examples of tests of significance as they might be applied to data 
from empirical investigations are described below. 

1. A psychologist wishes to know whether the student bodies of two large 
universities differ in verbal fluency. It is not practical to test all of the students 
so he obtains a random sample of 100 students from each university and 
administers the test to students in both samples under standard conditions. 
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He computes the mean of each sample, the difference between means, the 
combined within-sample sum of squares, and the combined within-sample 
variance. He then computes t by formula (12.2), 

D; 
= Javan 

In the formula, D, is the difference between sample means, V, is the combined 
within-sample variance, and n is the number of subjects per sample. V is 
computed by dividing the combined within-sample sum of squares by 
(2n — 2) degrees of freedom. The null hypothesis is that any observed 
difference between means is due to random error. He sets the level of sig- 
nificance at 0.05 and then refers to a table of 1 to classify his result. (See 
Table A in the Appendix.) If his obtained value of t exceeds te, positive, or 
is exceeded by f, negative, he rejects the null hypothesis and announces 
positive results. If it neither exceeds nor is exceeded, he accepts the null 
hypothesis and acknowledges negative results. 

2. The psychologist in the first example may also decide to compare the 
two sample variances. Doing so would serve two purposes. First, the test of 
the difference between means provided by formula (12.2), applied to two 
samples from different populations, requires that the populations have equal 
variances. If he compares the two sample variances in a test of significance, 
he will obtain evidence bearing on this requirement. In addition, he may 
actually be interested in comparing the dispersions of the two student bodies. 
Not all of the information one might desire in comparing two groups is 
necessarily contained in a comparison of means. To compare sample 
variances, he computes F by formula (12.5), 


Vi is computed by dividing a sample sum of squares by (n — 1) degrees of 
freedom. V is computed in the same way. The null hypothesis is that any 
Observed difference between the two sample variances is due to error. Since 
tables of F usually give only ratios greater than one, it is a general practice 
to place the larger variance in the numerator and the smaller in the denomi- 
nator. The effect is to double the number of values of F greater than F, for 
a given level of significance. To compensate and achieve a level of sig- 
nificance of 0.05, the psychologist chooses a value of F, exceeded by 0.025 
of the values in the sampling distribution. (See Table B, in the Appendix.) 
He then compares the computed F with F,, and accepts or rejects the null 
hypothesis. 

3. An educator has decided to investigate the relation between verbal 
ability and grades in the junior high schools of a large metropolitan area. He 
Obtains a random sample of junior high-school students and tests them on 
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verbal ability. He also obtains their average grades for the preceding term. 
He computes the product-moment correlation coefficient for the sample and 
from that value computes t by formula (12.3), 


bons ryNN —2. 
Vl — ri, 
In the formula, r+, is the sample correlation; N is the number of subjects in 
the sample. The null hypothesis is that any observed correlation is due to 
random error. He determines the criterion value of t for the 0.05 level of 


significance and (N — 2) degrees of freedom, and accepts or rejects the null 
hypothesis. (See Table A in the Appendix.) 


Estimating Parameters 


Within the context of the present discussion, an i-statistic is a value 
computed from one or more random samples. A corresponding value com- 
puted from the population is called a parameter. When a population is too 
large to be measured and described, that is, when it is too large to permit the 
computation of parameters, it is often of interest to the researcher to obtain 
an estimate of one or more of the parameters. In general terms, an attempt 
is made to infer values of parameters from sample values. 


Estimates are divided into two large classes: point estimates and interval 
estimates. They will be discussed in that order. 


POINT ESTIMATES 


A point estimate is a single value computed from one or more samples. 
It possesses certain properties which qualify it as an estimate of the corre- 
sponding parameter. Although several different kinds of point estimates can 
be distinguished, we shall consider only point estimates which are unbiased 
or approximately so. 

Consider any i-statistic, 


Co the corresponding parameter, and the sampling 
distribution of the i-statisti 


c. When the mean of the sampling distribution is 
equal to the parameter, any obtained value of the i-statistic is said to be an 
unbiased estimate of the parameter. When the mean of the sampling distribu- 
tion does not equal the parameter, the i-statistic is said to be a biased 
estimate. 

Four examples of unbiased estimates are given below. 


1. M,, the sample mean, is an unbiased estimate of I4 the population 


mean. That is, M,, the average of all possible sample means, is equal to the 
population mean, p. 
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. 2. D,, the difference between the means of two samples from two popula- 
tions, is an unbiased estimate of the difference between the two population 
means, 
. Bi — Be 
That is, 
D, = p — Be 

In other words, the average of the totality of differences between sample 
means is equal to the difference between the two population means. 

3. V, the sample variance computed with (n — 1) degrees of freedom is 
an unbiased estimate of the population variance, o°. That is, 


Vict 
or, in words, the average of all possible sample variances, when each one is 
computed with (n — 1) degrees of freedom, is equal to the population 


variance. 

The student may be interested in knowing that V,, the sample variance 
computed with n, is a biased estimate of the population variance, since the 
average of all such sample values is less than the population variance. In 


Symbols, 


Y, « ot. 
It can be shown that 
2 n— à 
y. = 0* «95. 


x n 


lation coefficient is 


4. When p,, the population product-moment corre ] 
n coefficient, is an 


zero, then r,,, the sample product-moment correlatio 
unbiased estimate of the parameter, since the mean of the sampling distribu- 


tion of r,, is zero. Symbolically, 


Fry = Pry = 0. 
zero, Izy iS a biased estimate. An 


When the population correlation is not d es 
when it is not zero, is given by 


approximately unbiased estimate of ps, 
(l — rey) . 


tev + Ten (NL — 1) 


INTERVAL ESTIMATES 


To obtain an interval estimate of a given parameter, one computes, from 
One or more samples, two values which are called limits. These limits establish 
an interval which is known to vary with sampling. That is to say, the end- 
Points of the interval have a sampling distribution. The interval has the 
Property that it will sometimes encompass or bracket the parameter and will 
sometimes fail to do so. The size of the interval can be established by the 
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researcher so that, in the totality of such intervals, some known and stated 
proportion of them will fail to bracket the parameter. By his choice of the 
size of the interval, the researcher can make the proportion of failures as 
small as he desires. In any given attempt, of course, he has no way of knowing 
whether the interval has succeeded or failed. He anticipates that, in a large 
number of such attempts, the interval will fail to bracket the parameter with 
the relative frequency he has intended. 

The logic of the procedure whereby one arrives at an interval estimate is 
illustrated below with an example involving the estimation of a population 
mean from the mean of a sample. As we indicated earlier in the discussion 
of formula (12.1), the i-statistic, 


is distributed as t. It will be convenient to let S, = JV1n. 
Then 


It follows that 
tS, = M, — p. 


The quantity, (M. — p), can be thought of as the distance of M, from p. 
Notice that t is negative when M. < p. and positive when M, > p. 

We shall choose a particular value of ¢ and designate it as t,, where the 
subscript refers to the proportion, P. The value of t, is chosen so that P/2 
of the values of t in the sampling distribution are less than t, when it is 
negative, and P/2 of the values of ¢ in the sampling distribution are greater 
than f, when it is positive. 

. Consider now the relation between 1S, and t,S,. In the sampling distribu- 
tion of tS., P/2 of the values will be less than t,S, when it is given a negative 
sign, and P/2 of the values will be greater than t,S,, when it is given a positive 
sign. That this relation holds can be verified by observing that 


tS, < tS, 

whenever j iai 
ts 
and that i 

tS, > tS. 

whenever & iie 
ios dy! 


Recall that tS, = M, — p = the distance of M, from pu. We conclude that 
the distance of a sample mean from the population mean varies with sampling 
and that it relates to t,S, just as does (S,. This is to say that P/2 of the values 
of (M, — p) in the sampling distribution of such distances will be less than 
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t,S,, when it is taken to be negative, and P/2 of the values will be greater 
than 1,,S,, when it is taken to be positive. We can take account of both classes 
of values by saying that the absolute value of (M, — p) exceeds the absolute 
value of t,S, in P of the totality of sample values. 

Now let M, represent any point on the scale of values for sample means. 
We can construct an interval on this scale around the point, M;. The limits 
of the interval will be 

M. — tySe 
and 

M. + tSo 
The interval will have the following characteristics. When M; < p, the 
interval will fail to bracket or encompass y: for any sample whose mean isa 
distance from the population mean greater than f,S,, a circumstance which 
is true for P/2 of samples. When M, > p, the interval will again fail to 
bracket for any sample whose mean is more distant from the population 
mean than t,S,, a characteristic of P/2 samples. Thus the interval, 


My x tSo 
will fail to bracket p in P of all samples. The interval is called an interval 
estimate of the population mean, p. The value, M, — tpSe is called the 
lower limit of the estimate. The value, M; + 1,S,, is called the upper limit, 
Considered together, the two values are called confidence limits. The value 


of P is called the level of confidence. . 
Figure 16.1 shows graphically how interval estimates of the population 


Sample 1 


Sample 2 

ITA Ss 

er E je inm Q9 
L uw M. U 


Sample 3 


Sample 4 
ELE SS 
MEME a. a nl 
Lu Mz U 
Sample 5 
p] 
———— e ae —XÁ 
L p M: U 


Fig. 16.1 Interval estimates of the population mean, x, computed from five random 
samples from the same population. M is the sample mean; L is the lower limit of the 
interval; U is the upper limit of the interval. Notice how the locations and the widths of 


the intervals vary from one sample to another. 
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mean can vary from one sample to another. In the figure, pz is the population 
mean, M, is the sample mean, L is the lower limit of the interval, U is the 
upper limit. Notice that the location of the interval and the width of the 
interval vary. The population mean does not vary. In one of the five examples 
of Figure 16.1, the estimate fails to bracket the population mean. 


AN EXAMPLE 


An instructor in physical education wishes to determine the average pulse 
rate for college freshmen who have been enrolled in special gymnastics 
classes for a period of eight weeks. He obtains a sample of 50 students from 
a total enrollment of 2,300. The mean of the 50 determinations of pulse rate 
is 80. The variance, Vj, is 4.5. S., which is equal to /V2/n, is 0.30. For 49 
degrees of freedom and the 0.05 level of confidence, t, = 2.01, a value 


obtained from Table A by linear interpolation. The lower limit for the interval 
estimate is 


M, — t,S, = 80 — (2.01)(.30) = 79.397. 
The upper limit is 


M, + t,S, = 80 + (2.01.30) = 80.603. 


Thus the estimate is the interval from 79.397 to 80.603. The instructor knows 
that 5 per cent of the totality of such estimates fail to bracket the parameter. 


INTERVAL ESTIMATE OF THE VARIANCE 


The problem is to establish an interval which fails to bracket the population 
variance, o°, for a proportion, P, of the totality of samples of size n. 
It can be shown that the ratio, 


(n — DV 


c? 


is distributed as a chi square function identified b 
(See Table C 


follows that 


1 y (n — 1) degrees of freedom. 
2 in the Appendix.) The symbol for “ chi square" is x?. It 


Let us specify a value of chi Square, yj, such that P/2 of values of chi square 


in the sampling distribution are less than the specified value. Then it follows 
that 


x x 
à —D mei 
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for P/2 values in the sampling distribution. Since the left member of this 
inequality is equal to the ratio, V;/c*, we can substitute the ratio and obtain 


L^ xi 


o ^(n—1) 
for P/2 values in the sampling distribution. By multiplication and division on 
both sides of the inequality, we obtain 

— DA 

(n : We eco 

xi 
for P/2 values in the sampling distribution. Let us call the quantity on the left 
U for it will be the upper limit in the interval estimate. Note that U is less 


than o? in P/2 samples. 
Let us now specify a second value of chi square, x5, such that P/2 values of 
x? in the sampling distribution are greater than the specified value. It follows 


that 


x X 
e-20^&-1 


2 
2 


for P/2 values in the distribution. Again we can substitute the ratio of V; to 


o? on the left and obtain 
Vi x 
o > (n — 1) 
for P/2 values in the sampling distribution. By multiplication and division on 
both sides of the inequality, we obtain 
(n = De: > 0 
Xs 


ution. We shall call the quantity on the 
f the interval estimate. Note that L is 
hat L < U because x} < X 

1 which will fail to bracket c? in 
from a population. 


for P/2 values in the sampling distrib 
left L for it will be the lower limit o 
greater than o? in P/2 samples. Note also t 

The two limits, L and U, form an interval w 
P samples from the totality of samples of n units 


AN EXAMPLE 

Free association times to verbs as stimulus words were determined for a 
sample of 30 male adults. The mean of the 30 response times was 1.5 seconds. 
The unbiased estimate of the population variance was 0.04 seconds. The 
investigator proceeded to establish an interval estimate with a level of 
confidence of 0.05 by computing the following quantities: 

(n — DV; _ Q9X004 _ 9 979, 
E 16.0 


2 


x 
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(0— DV: — (290.04) 
Xi 457 


Values of chi square were obtained from Table C, in the Appendix. 


= 0.025. 


AN INTERVAL ESTIMATE OF THE PRODUCT-MOMENT 
COEFFICIENT 


Because a product-moment correlation coefficient is a dimensionless 
number not influenced by certain arbitrary scale differences, psychologists 
frequently use interval estimates of that coefficient. The problem is to obtain 
limits for an interval which will be computed from rzy, the sample product- 
moment correlation coefficient, and used to estimate Pxy, the population 
coefficient. 

When pzy, the population correlation, is not zero, the distribution of r,,, 
is skewed. Its distribution has been tabulated for sample sizes of 25 or less.! 
For larger samples, it has been shown that 


1 1 + Prey 
oe GT 
is approximately normally distributed with variance, approximately 1/(N — 3). 
The problem is to estimate Z, which is a function of pz,, as indicated by 


1+ Poy | 
"1— Pry 
We determine 6,,, the standard normal deviate appropriate for the level of 
confidence, P. (See the last line of Table A in the Appendix.) Then the limits 
are found in the following manner: Compute the sample correlation 
coefficient, r,,. Compute Z, by the transformation formula given above. 
Compute S, = I//N — 3. The lower limit is then given by the expression 


s. d Ss 


1 
b = 5 log 


The upper limit is 
Z,-P8,8. 


These confidence limits define an interval which will fail to bracket the 
parameter in approximately P of the totality of estimates. 

It is usually desired to express the limits in terms of correlation. Tables are 
available which permit one to read values of a correlation coefficient corres- 
ponding to the upper and lower limits on the scale of Z,. These tables can 
also be used for the transformation of r,, to Lint 


See F. N. David, Tables 
College, London. 


*See R. A. Fisher and Frank Yates, Statistical Tables for Biological, 7 
: 4 gical, Agricultural and 
Medical Research, 3rd ed. (Edinburgh and London: Oliver and Boyd, 1943). : 


of the Correlation Coefficient, Biometrika Office, University 
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AN EXAMPLE 
A public opinion pollster obtains a sample correlation of 0.60 for the 
relation between income and favorableness of attitude toward a certain 
candidate for public office. N, the number of respondents in his sample, is 


103. For a level of confidence equal to 0.05, 5, is 1.96. The value of Z, 
corresponding to r,, = 0.60 is 0.69. S. is 0.1. The lower limit for the interval 


estimate of € is 
0.69 — (0.1)(1.96) = 0.49. 


The upper limit is 
0.69 + (0.1)(1.96) = 0.89. 


Converted to the correlation scale, the confidence limits are 0.455 and 0.710, 
approximately. 


FREQUENCY 
CLASSIFICATIONS 


ET 


Univariate and bivariate frequency distributions have figured prominently 
in the preceding chapters. A univariate distribution expresses the relation 
between frequency and the magnitude of a single variable. A bivariate distri- 
bution expresses the relation between frequency and the magnitudes of values 
for two variables. Another important kind of research data takes the form 
of frequencies in classes or categories which are distinguished qualitatively 
rather than quantitatively. 

The simplest form of frequency classification involves one set of two 
categories. For example, a psychologist may obtain a sample of N respondents 
in a public opinion survey. As one part of the interview conducted with each 
respondent, a question is asked and an answer of “ Yes" or “ No" is 
obtained. He computes the proportion, p of those answering '* Yes," The 
i-statistic, p, is an unbiased estimate of the population proportion, 


Interval Estimate of a Proportion 


If the psychologist wishes to establish an interval estimate of the population 
Proportion, he has a choice of three methods: computing exact limits, reading 
limits from a table, and computing approximate limits. Computing exact 
limits is somewhat difficult, unless the sample is quite small, and will not be 
described here. Tables by Clopper and Pearson (1), Fisher and Yates (3), and 
Mainland (4) are available. In these tables, one can locate the sample propor- 
tion and read directly the confidence limits for some specified level of 


confidence. The procedure for computing approximate limits will be given 
below. 
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The sample proportion, p, is computed and then transformed to an angle 

by the formula, 
8 = sint p. 

To compute 0 by means of this formula, one obtains the square root of the 
proportion and reads from a table of trigonometric functions the value of the 
angle whose sine is the computed square root. The variance of the sampling 
distribution of 0 is approximately equal to 821/N. Values of 0 are approxi- 
mately normally distributed. The lower limit of the interval estimate of the 
parametric angle is 
821 
0 —98, [— 
where 8, is the standard normal deviate for the desired level of confidence. 
(See the last line of Table A in the Appendix.) The upper limit is 


TS 
The approximation resulting from use of the inverse sine transformation is 
considered very good when the parametric proportion lies between 0.05 and 
0.95 and when N = 20. Tables have been made available by Fisher and 
Yates (3) for converting proportions directly to angles and for converting 
angular limits to limits on the proportion scale. 


Single Set of More Than Two Categories 


It may be that the psychologist obtains more than two responses to an 
interview question. For example, the subject may be asked to respond by 
answering “ Yes," “ No," or * Undecided." These responses, in effect, 
classify the N subjects into three categories. It is possible to test the signi- 
ficance of the differences among the three categories in terms of the dis- 
crepancies between the three observed frequencies and the average of the 
three frequencies. The test of significance involves the i-statistic, chi square, 
which is computed, as follows. . 

The observed frequency, fo, is determined for each of the three categories- 
The average frequency, f. is given by Z/f,/c, where c is the number of categories 
and, in the example under consideration, is equal to 3. The average frequency 
is, of course, also given by N/c. A discrepancy, (f, —f), is computed for each 
category. Each of these discrepancies is squared. Finally, the sum of the 
c squared deviations is computed. The i-statistic, chi square, is the sum of the 
squared deviations divided by the average frequency. That is, 

1 = 
2 —LXf,-—fY. 
es (s) 
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The null hypothesis is that variation in the observed frequencies is due to 
sampling. . 

The psychologist chooses a level of significance and consults a table of chi 
square to determine the criterion value of chi square for (c — 1) degrees of 
freedom. (See Table C, in the Appendix.) If the obtained value exceeds the 
criterion value, he rejects the null hypothesis. If it does not, he accepts the 
null hypothesis. 


The Significance of a Difference 
Between Proportions 


When the researcher has obtained two samples from different populations 
and has determined the proportion of subjects in each sample who respond 
in a specified way, he may wish to examine the difference between the two 
proportions and make an inference about the difference between the two 
populations. The procedure for the test of significance is described below. 

À proportion is computed for each sample. Let us use p, for the first sample 
and p, for the second sample. The corresponding angles will be denoted 0, 
and 0,. 0, is determined by computing the square root of pı and referring 
the root to a table of trigonometric functions for the angle whose sine is that 
Square root. 0, is determined in the same way. The i-statistic used in the test 
of significance is a ratio involving the difference between the two angles and 
the value, /821/N, + 821/N,, where N, and N, are sample sizes. The ratio 
is distributed approximately as 8, the standard normal deviate. The formula 
for à is 


0, — 6, 


JB2I/N, + 821/N, 
The null hypothesis is that any observed difference between the two 
proportions is due to sampling. 
A level of significance is chosen and the criterion value of 6 is obtained 
from a table of standard normal deviates. (See the last line of Table A in the 


Appendix.) If the absolute value of the obtained 8 exceeds the criterion value, 
the null hypothesis is rejected. 


Joint Frequency Classifications 


It is often of interest to a psycholo 
population are jointly classified byt 
categories be labeled R,, Ra, R 


gist to determine how the units of some 
wo sets of categories. Let the first set of 
9s Ri... , R, and the second set, C, 
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Co, Cs, C. C.. Joi ifyi i 
2 Cy ss. Ci... , Ce Jointly classifying units of the populati i 
: : g pulation requires 
i (re) subcategories defined by forming all combinations of the type c. 
" H wx iw: 1 m 
t is convenient to represent the joint classification, in general form, by a 
rectangular table of r rows and c columns, such as the one in Table 17.1 


Table 17.1 
CATEGORIES AND SUBCATEGORIES FOR A JOINT CLASSIFICATION 


C Categories 
Ci Cs Cs . Cy : c. 


Categories i 


Each cell of the table represents a subcategory formed by combining one 
category from each of the original sets given on the margins of the table. We 
shall assume for purposes of discussion that an entire population can be 
classified by the subcategories of Table 17.1. 

The purpose of investigating the joint classification would be to ascertain 
whether or not there was a relation between the two ways of classifying. No 
relation between the two ways of classifying is taken to mean that the relative 
frequency distributions of all columns are the same and that the relative 
frequency distributions of all rows are the same. When these conditions 
obtain, we say that the two ways of classifying are independent. We mean 
simply that the relative frequency distributions of either classification do not 
vary. In still other words, the differences among column relative frequencies 
do not vary with rows. 

If we let 
the population relative frequency for category Co 


tion relative frequency for category R;, and 
ubcategory R;Cr, 


Da = 
Dj. = the popula 
Dj; = the population relative frequency for s 
bution for the population can be 
2. When the two ways of classifying 
lation between them, 


then the joint, relative frequency distri 
represented, in general form, by Table 17. 
are independent, that is, when there is no re 


Pik = DP 
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When this equality does not hold for all (rc) cells for the population joint 
classification, we say that the two ways of classifying are not independent 
and that there is a relation between them. 


Table 17.2 
THE JOINT RELATIVE FREQUENCY DISTRIBUTION FOR A 
POPULATION 


Relative Frequencies for the C Categories 
Par D.» P.a . Px 


Relative 


Frequencies 


for the R 


Categories 


Pr, 


When the population is too large to be classified in its entirety, the 
investigator may obtain a random sample of N units from the population, 
classify the units in the sample into the subcategories of an (r x c) table, and 
make an inference from the sample to the population regarding the relation 
between the two ways of classifying. 

The test of significance which can be applied to the sample frequency data 
requires the computation of chi Square, as indicated by the formula: 


TL OE? 
de^ uw. 


> 


where f, is the observed frequency in each subcategory and Ja the so-called 
expected frequency, is computed from the marginal frequencies and the total 
frequency, as indicated by the expression, f;f,/N. The summation has (re) 


terms, consisting of one value from each cell. The number of degrees of 
freedom is (r — Ive — 1). 
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chi square in the subset form the sampling distribution on which the test of 
significance is based. 


AN EXAMPLE 


The example we shall give would be classified, in our system, as an empirical 
research study. 

A psychologist is hired to determine whether political affiliation is related 
to position on a current issue, where the position of an individual subject is 
indicated by a response of “ Yes " or “ No” to an interview question. A 
random sample of 150 registered voters is obtained and they are jointly 
classified as being Democrat or Republican and as having answered “ Yes" 
or “ No” to the question. The joint classification for the sample is given in 
Table 17.3. The computation of the expected frequency for each cell is shown 
in the table. 

Table 17.3 


JOINT CLASSIFICATION OF VOTERS 


Democrat Republican Sj. 


fy = 75 fy = 25 
Yes 
f, = (100)(90)/150 = 60 | fe = (100)(60)/150 = 40 
fy = 15 fo = 35 50 
No 
f, = (50)(90)/150 = 30 | fe = (50)(60)/150 = 20 
p 90 60 N — 150 


J, = observed frequency. 
Ja = expected frequency. 


The null hypothesis is that any discrepancies between observed and 


expected frequencies are due to sampling. . : 
The i-statistic, chi square, is computed by finding the difference between 


the observed and expected frequency for each cell, squaring that difference, 
dividing the squared difference by the expected frequency, and summing 
these four quotients for the four cells of the 2 x 2 table. The computation is 
given below. 
. (75-60)? , Qs-40* , (15-30) | GÀ-20* _ 28,125 
~ 60 40 30 20 

The criterion value of chi square for (r — D(c—D-1 degree of nd 
and the 0.05 level of significance is 3.841. (See Table C, in the Appen v 
The null hypothesis is rejected and the investigator announces the finding o 
à relation between political affiliation and position on the issue. 

Since the chi square function is a continuous curve and the values com 


2 


puted 
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from frequency data are always discrete values, the actual sampling distribu- 
tion is only approximated by the continuous function. For large samples, the 
disparity between the two distributions is negligible. For small samples, the 
disparity can be serious. In 2 x 2 classifications, it is possible to improve the 
approximation by reducing the absolute value of each difference, (f, — fe), 
by 0.5 before computing chi square. The effect of this correction is to reduce 
the size of chi square. If one wishes to avoid the problem of discontinuity, 
one should apply the exact test described by Fisher (2). 
It is interesting to note that 
2 = Ng? 


where ¢ is the phi coefficient, the product-moment coefficient for two 


dichotomous variables, when each has been coded to or assigned values of 
O and 1. 
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APPENDIX 


TABLE A“ 


Values of t exceeded by 0.025 of values in the sampling distribution. 

In testing the significance of a difference between two sample means, the number 
of degrees of freedom is 2(n — 1) where n is the number of units per sample. 

In testing the significance of a product-moment correlation coefficient, the number 
of degrees of freedom is (N — 2) where N is the number of pairs in the sample. 

In computing an interval estimate of the population mean, the number of degrees 
of freedom is (n — 1) where n is the number of units in the sample. 

When the number of degrees of freedom is indefinitely large, t is equal to 5, the 
standard normal deviate. Therefore, the value of à exceeded by 0.025 of values in 
the sampling distribution is given in the last line of the table. 

If values of t and 6 are given negative signs, then these va 
values in their respective distributions. 


lues exceed 0.025 of 


t 
12.706 
4.303 
3.182 
2.776 
2.571 
2.447 
2.365 
2.306 
2.262 
10 2.228 
11 2.201 
2.179 


Statistical Tables for Biological, 
d., Edinburgh, by 


& 


v0 300 RU It-— 


“Table A is abridged from Table III for Fisher and Yates, 
Agricultural and Medical Research, published by Oliver & Boyd Lt 


Permission of the authors and publishers. 
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TABLE A continued 


13 2.160 
14 2.145 
15 2.131 
16 2.120 
17 2.110 
18 2.101 
19 2.093 
20 2.086 
21 2.080 
22 2.074 
23 2.069 
24 2.064 
25 2.060 
26 2.056 
27 2.052 
28 2.048 
29 2.045 
30 2.042 
40 2.021 
60 2.000 
120 1.980 
oo 1.960 


5Note that ¢ = ô, the standard normal deviate, when the number of degrees of freedom 
is indefinitely large. 


Appendix 


Values of F exceeded by 0.05 of the values in the sampling distribution. 


TABLE B;* 
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im table can be used, when the 0.05 level of significance is desired, in testing 
ifferences between means, correlation, and homogeneity of sample correlations in 


analysis of covariance. 


df 

for 

Variance 

in 
Denominator 


Interpolation may be performed usin 


“This table is abridged from M. Merrington 
centage Points of the Inverted Beta 
here with permission o 


1 


7.71 
6.61 
5.99 
5.59 
5.32 
5.12 
4.96 
4.84 
4.75 
4.67 
4.60 
4.54 
4.49 
4.45 
441 
4.38 
4.35 
4.32 
4.30 
4.28 
4.26 
4.24 
4.17 
4.08 


df for Variance in Numerator 


2 


6.94 
5.79 
5.14 
4.74 
4.46 
4.26 
4.10 
3.98 
3.89 
3.81 
3.74 
3.68 
3.63 
3.59 
3:55 
3.532 
3.49 
3.47 
3.44 
3.42 
3.40 
Bae 
3.32 
3.23 


Distribution,” 
f the Trustees of Biometrika. 


3 


6.59 
5.41 
4.76 
4.35 
4.07 
3.86 
3.71 
3.59 
3.49 
3.41 
3.34 
3.29 
3.24 
3.20 
3.16 
3.13 
3.10 
3.07 
3.05 
3.03 
3.01 
2.99 
2.92 
2.84 


4 


6.39 
5.19 
4.53 
4.12 
3.84 
3.63 
3.48 
3.36 
3.26 
3.18 
3.11 

3.06 
3.01 

2.96 
2.93 
2.90 
2.87 
2.84 
2.82 
2.80 
2.78 
2.76 
2.69 
2.61 


5 


6.26 
5.05 
4.39 
3.97 
3.69 
3.48 
3.33 
3.20 
3.11 
3.03 
2.96 
2.90 
2.85 
2.81 
2.71 
2.74 
2:21 
2.68 
2.66 
2.64 
2.62 
2.60 
2799 
2.45 


6 


6.16 
4.95 
4.28 
3.87 
3.58 
3.37 
3.22 
3.09 
3.00 
2.92 
2.85 
2.79 
2.74 
2.70 
2.66 
2.63 
2.60 
2.57 
2.55 
2.53 
2.51 
2.49 
242 
2.34 


7 


6.09 
4.88 
4.21 
3.79 
3.50 
3.29 
3.14 
3.01 
2.91 
2.83 
2.76 
2.71 
2.66 
2.61 
2.58 
2.54 
2,51 
2.49 
2.46 
2.44 
2.42 
2.40 
2.33 
2.25 


g reciprocals of the degrees of freedom. 


and C. M. Thompson, “Tables of Per- 
Biometrika, 33: 73 (1943). It is printed 
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TABLE B.* 
Values of F exceeded by 0.025 of the values in the sampling distribution. 
If, in testing the homogeneity of two sample variances, the larger variance is 
placed over the smaller, the number of ratios greater than any given value, equal to 


or greater than unity, is doubled. Therefore use of the values tabled below, in 
comparing two sample variances, will provide a 0.05 level of significance. 


df for Larger Variance (numerator) 
4 5 6 7 8 9 10 12 15 20 


4 9.60 9.36 9.20 9.07 8.98 8.90 8.84 8.75 8.66 8.56 
5 7.39 7.15 6.98 685 6.76 6.68 6.62 652 6.43 6.33 
df 6 623 5.99 5.82 5.70 5.60 5.52 546 537 527 5.17 
for 7 5.52 529 5.12 4.9 4.90 482 4.76 4.67 457 4.47 
Smaller 8 5.05 4.82 4.65 4.53 443 4.36 4.30 4.20 4.10 4.00 
Variance 9 4.72 4.48 4.32 4.20 4.10 4.03 3.96 3.87 3.77 3.67 


(denomi- 10 4.47 424 4.07 3.95 3.85 3.78 3.72 3.62 3,52 3.42 
nator) 12 412 3.89 3.73 3.61 3.51 344 3.37 328 3.18 3.07 
15 3.80 3.58 3.41 3.29 320 3.12 3.06 2.96 2.86 2.76 
20 3.51 3.29 3.13 3.0 2.91 2.84 2.77 2.68 2.57 246 


Interpolation may be performed using reciprocals of the degrees of freedom. 


“This table is abridged from M. Merrington and C. M. Thompson, *“ Tables of Per- 


centage Points of the Inverted Beta Distribution," Biometrika, 33: 73 (1943). It is published 
here with permission of the Trustees of Biometrika. 
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TABLE C;* 


Values of chi square exceeded by 0.05 of values in the sampling distribution. 

In using chi square for a test of significance on a single set of c categories, the 
number of degrees of freedom is (c — 1). In using chi square for a test of significance 
on a joint classification of r rows and c columns, the number of degrees of freedom 
is (r — DXc — 1). 


df x? 
1 3.841 
2 5.991 
3 7.815 
4 9.488 
5 11.070 
6 12.592 
7 14.067 
8 15.507 
9 16.919 

10 18.307 

11 19.675 

12 21.026 

13 22.362 

14 23.685 

15 24.996 

16 26.296 


A. Fisher, Statistical Methods for Research 


“Table C, is abridged from Table III of R. eth 
946, Edinburgh, by permission of the author 


Workers, published by Oliver & Boyd Ltd., 1 
and publishers. 
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TABLE C; 


Two chi square values are given below for each number of degrees of freedom. 
The value in the left column, x7, exceeds 0.025 of the values in the sampling distribu- 
tion. The value in the right column, x2, is exceeded by 0.025 of the values in the 
sampling distribution. 

In using these values to establish an interval estimate of the population variance, 
the number of degrees of freedom is (n — 1) where z is the number of measures in 
a random sample. 


df gt x 
4 0.484 11.1 
5 0.831 12.8 
6 1.24 14.4 
7 1.69 16.0 
8 2.18 17.5 
9 2.70 19.0 
10 3.25 20.5 
11 3.82 21.9 
12 4.40 23.3 
13 5.01 24.7 
14 5.63 26.1 
15 6.26 27.5 
16 6.91 28.8 
17 7.56 30.2 
18 8.23 31.5 
19 8.91 32.9 

20 9.59 34.2 

21 10.3 35.5 

22 11.0 36.8 

23 11:7 38.1 

24 124 39.4 

25 13.1 40.6 

26 13.8 41.9 

27 14.6 43.2 

28 15.3 44.5 

29 16.0 45.7 

30 16.8 47.0 


"Table C, was abridged from C. M. Thompson, “ Tables of Percentage Points of the 
Incomplete Beta Function and of the Chi Square Distribution," Biometrika, 32: pp. 188-189 
(1941), It is printed here with permission of the Trustees of Biometrika. 
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Reproducibility, 101-103, 181-182 
Research, 245-249 
empirical, 246, 278-300 
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Sampling 
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operations, 230-238 
practice, 226-243 
theory, 201-209 
Sampling distributions, 210-225 
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chi square, 213 
8, 220-221 
difference between means, 213 
F, 218-220 
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t, 215-217 
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problems in, 9-11 
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Signed numbers, 21-24 
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Standard deviation, 70 
Standard normal deviate, 220-221 
Summation sign, 56-67 
definitions, 56-57 
in equations, 63-65 
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simplification rules, 60-61 
Syntactics, 3 
algebra, 25-55 
analysis of variability, 184-200 
arithemetic, 17-25 
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partial and multiple correlation, 160-172 


problems in, 8-9 

product moments, 128-159 
sampling distributions, 210-225 
sampling theory, 201-209 
summation sign, 56-67 


Tabled Functions, 221-225 
chi square, 299-300 
ô, 296 
F, 297-298 
t, 295-296 
Test construction, 181-182 
Testing significance, 249-251 
empirical research, 278-280 
experimental research, 253-277 
frequency classifications, 289-294 
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and nonnormality, 242 
coding, 85-90, 147-152 
deviation from the mean, 69 
special, 90-91, 152-155 
standard score, 71-72 


Validity, 104-106, 182 
Variance 
between-sample, 213-215 
distribution of, 212-213 
of a sample, 212-213 
within-sample, 213-215 
Variance of a set of numbers, 70-71 
computing formula, 71 
properties of, 77-78 
Variance ratio, 259 
distribution of, 218-220 


significance of, 258-259, 268, 271, 274- 
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